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1 
The Physical Basis of 
Quantum Mechanics 


At the present stage of human knowledge, quantum mechanics can be 
regarded as the fundamental theory of atomic phenomena. The experi- 
mental data on which it is based are derived from physical events that 
lie almost entirely beyond the range of direct human perception. It is 
not surprising, therefore, that the theory embodies physical concepts that 
are foreign to common daily experience. These concepts did not appear 
in the historical development of quantum mechanics, however, until a 
quite complete mathematical formalism had been evolved. The need 
for quantitative comparison with observation, which is the ultimate test 
of any physical theory, in this case led first to the formalism and only 
later to its interpretation in physical terms. . 

It seems desirable in introducing the subject of quantum mechanics 
to depart from the historical order and preface the mathematical develop- 
ment with a discussion of the physical concepts.! In this chapter we first 
review briefly the experimental background and the ideas of the old 


1 For a detailed study of the historical development, see M. Jammer, ‘“The Conceptual 
Development of Quantum Mechanics” (McGraw-Hill, New York, 1966). 
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quantum theory, then discuss the newer physical concepts of uncertainty 
and complementarity, and finally lay the groundwork for the formalism 
that will be developed in its most familiar form in Chap. 2. No attempt 
will be made to deduce the structure of the formalism from the funda- 
mental experiments; we shall try to make the theoretical development 
seem plausible rather than unique. The justification for the theory, 
then, will rest on the agreement between deductions made from it and 
experiments, and on the simplicity (in principle more than in practice) 
and consistency of the formalism. 


1DEXPERIMENTAL BACKGROUND 


Experimental physics prior to 1900 had demonstrated the existence of 
a wide variety of phenomena, which for the most part were believed 
to be explicable in terms of what we now call classical theoretical physics. 
The motions of mechanical objects were successfully discussed in terms ~ 
of Newton’s equations on both celestial and terrestrial scales. Appli- 
cation of this theory to molecular motions produced useful results in the 
kinetic theory of gases, and the discovery of the electron by J. J. Thomson 
in 1897 consisted in showing that it behaved like a newtonian particle. 
The wave nature of light had been strongly suggested by the diffraction 
experiments of Young in 1803 and was put on a firmer foundation by 
Maxwell’s discovery in 1864 of the connection between optical and elec- 
trical phenomena. 


INADEQUACY OF CLASSICAL PHYSICS 


The difficulties in the understanding.-of experimental results that remained 
at the beginning of this century were largely concerned with the develop- 
ment of a suitable atomic model and with the late discoveries of x-rays 
and radioactivity. However, there were also difficulties associated with 
phenomena that should have been understood but actually were not: such 
things as the spectral distribution of thermal radiation from a blaekbody, 
the low-temperature specific heats of solids, and the appearance of only 
5 degrees of freedom in the motion of a free diatomic molecule at ordinary 
temperatures. 

The beginning of an understanding of the second class of difficulties 
was made by Planck in 1900, when he was able to explain the blackbody 
spectrum in terms of the assumed emission and absorption of electro- 
magnetic radiation in discrete quanta, each of which contains an amount 
of energy E that is equal to the frequency of the radiation v multiplied 
by a universal constant h (called Planck’s constant): 


E = hy : (1.1) 
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This quantum idea was later used by Einstein in accounting for some of 
the experimental observations on the photoelectric effect. In this way 
the dual character of electromagnetic radiation became established: It 
sometimes behaves like a wave motion and sometimes like a stream of 
corpuscular quanta. 

At about this time, the existence of discrete values for the measur- 


able parameters of atomic systems (not only of electromagnetic radiation) : 


became apparent through Einstein’s and Debye’s theories of the specific 
heats of solids, Ritz’s classification of spectral lines, the experiment of 
Franck and Hertz on the discrete energy losses of electrons on collision 
with atoms, and (somewhat later) the experiment of Stern and Gerlach, 
which showed that the component of the magnetic moment of an atom 
along an external magnetic field has discrete values. : 


SUMMARY OF PRINCIPAL EXPERIMENTS AND INFERENCES 


The theoretical physics of the first quarter of this century thus contained 
two important inferences, obtained from the experiments and their inter- 
pretations, that had not existed in 1900: the dual character of electro- 
magnetic radiation and the existence of discrete values for physical quan- 
tities. The relations between the principal experimental conclusions and 
the theoretical inferences are shown schematically in Table 1; for a more 
detailed discussion and a bibliography, reference should be made to a 
book on atomic physics.! 

A third theoretical inference appeared in 1924 with the suggestion 
by de Broglie that matter also has a dual (particlelike and wavelike) 
character; he assumed that the relation between the momentum p of any 
particle and the length of the corresponding wave is? 

h 

» 3 (1.2) 
Up to that time all the evidence had indicated that matter was composed 
of discrete newtonian particles; in particular, sharp tracks of charged 
particles such as electrons and helium nuclei had been observed in expan- 
sion cloud chambers like that invented by C. T. R. Wilson in 1911. 
Shortly after this, however, Davisson and Germer (1927) and G. P. 
Thomson (1928) independently observed the diffraction of electrons by 
crystals and thus confirmed de Broglie’s principal supposition. 


1 See, for example, F. K. Richtmyer, E. H. Kennard, and T. Lauritsen, ‘Introduction 
to Modern Physics” (McGraw-Hill, New York, 1955); M. Born, ‘‘Atomic Physics’ 
(Hafner, New York, 1951); G. P. Harnwell and W. E. Stephens, ‘Atomic Physics” 
(McGraw-Hill, New York, 1955). : 

2 Equation (1.2) is also valid for light quanta, as may be seen by dividing both sides 
of Eq. (1.1) by the velocity of light, c; for a directed beam of light p = E/cand\ = c/». 
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Table 1 Relations between experimental 
interpretations and theoretical inferences 


Diffraction (Young 1803, Laue 1912) Electromagnetic waves 
Blackbody radiation (Planck 1900) 
Photoelectric effect (Einstein 1904) 


Compton effect (1923) Electromagnetic quanta 


Combination principle (Ritz-Rydberg 1908) Discrete 
Specific heats (Einstein 1907, Debye 1912) values for 
Franck-Hertz experiment (1913) : physical 
Stern-Gerlach experiment (1922) quantities 


2LITHE OLD QUANTUM THEORY 


What is now called the old quantum theory! was initiated by the work of 
Planck on blackbody radiation and was carried farther by Einstein and 
Debye. However, only after Rutherford’s discovery in 1911 that an 
atom consists of a small, massive, positively charged nucleus surrounded 
by electrons could the theory be applied to a quantitative description 
of atoms. 


BOHR-SOMMERFELD QUANTIZATION RULES 


The first step in this direction was taken by Bohr in 1913, when he made 
two postulates concerning the electronic or extranuclear structure of an 
atom. The first of these was that an atomic system can exist in particular 
stationary or quantized states, each of which corresponds to a definite 
energy of the system. Transitions from one stationary state to another 
are accompanied by the gain or loss, as the case may be, of an amount of 
energy equal to the energy difference between the two states; the energy 
gained or lost appears as a quantum of electromagnetic radiation, or as 
internal or kinetic energy of another system. The second postulate (in 
agreement with that of Planck and Einstein) was that a radiation 
quantum has a frequency equal to its energy divided by Planck’s 
constant h. . 

These two postulates by themselves provided some insight into the 
Ritz combination principle and the Franck-Hertz experiment. To obtain 
specific results for hydrogen, Bohr proposed a simple rule for the selection 
of the circular orbits that are to constitute stationary states: The angular 
momentum must be an integral multiple of h/2x. A more general quanti- 
zation rule was discovered independently by W. Wilson (1915) and by 


1 For a more detailed discussion than is presented in this section, see the books cited 
above, and L. Pauling and E. B. Wilson, Jr., ‘Introduction to Quantum Mechanics,”’ 
chap. II (McGraw-Hill, New York, 1935). 


THE ‘PHYSICAL BASIS OF QUANTUM MECHANICS : 5 


Sommerfeld (1916), thus making possible the application of Bohr’s postu- 
lates to a wider variety of atomic systems. This rule is applicable to 
hamiltonian systems in which the coordinates are cyclic variables; it 
states that the integral of each canonical momentum with respect to its 
coordinate over a cycle of its motion must be an integral multiple of h. 
The rule was applied with considerable success to the computation of the 
fine structure of hydrogen, the spectra of diatomic molecules, and other 
problems. 


PRACTICAL DIFFICULTIES 


The old quantum theory encountered practical difficulties in several 
different respects. It could not be applied to aperiodic systems, it pro- 
vided only a qualitative and incomplete treatment of the intensities of 
spectral lines, and it did not give a satisfactory account of the dispersion 
of light. Moreover, improvements in experimental techniques soon 
showed that there were problems, such as the rotational spectra of some 
diatomic molecules, to which the theory gave unambiguous but incorrect 
answers. 

The correspondence Serene was introduced by Bohr in 1923 in an 
effort to make use of the classical theory as a limiting case to infer some 
properties of atomic systems, especially the intensities of spectral lines. 
Although much was achieved in this way, it was clear in the early 1920’s 
that the quantum theory as it then existed was unsatisfactory. 


CONCEPTUAL DIFFICULTIES 


Quite apart from the practical difficulties outlined above, the old quantum 
theory failed to give a conceptually satisfactory account of the funda- 
mental phenomena. It was difficult to understand why the electrostatic 
interaction between a hydrogen nucleus and an electron should be effective 
when the ability of the accelerated electron to emit electromagnetic radia- 
tion disappeared in a stationary state. The mechanism of emission and 
absorption of radiation in transitions between stationary states was 
obscure. The quantization rules were arbitrary even when they were 
most effective. And the assumption of a dual character for light (particle- 
like on emission and absorption and wavelike in transit) seemed to be 
self-contradictory. 

In order to illustrate the conceptual difficulties and the way in which 
they are dealt with by the new quantum mechanics, we consider in some 
detail a simple diffraction experiment, which is illustrated schematically 
in Fig. 1. A light source S illuminates a diaphragm A in which two slits 
are cut. -A diffraction pattern appears at a photosensitive screen B, 
and the ejected photoelectrons are most numerous at the diffraction 

peaks. Here we have the radiation behaving as a wave during its passage 


poseae ote 
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Fig. 1 A diffraction experiment in which light from S passes 
through the two slits in A to form a diffraction pattern at B. 


from source through slits to screen, but behaving as a stream of light 
quanta or photons when it ejects electrons from B. We now know that a 
similar experiment could be set up with matter instead of radiation. The 
diffraction pattern of electrons scattered from a crystal (analogous to the 
slits in A) may be detected as a distribution of electron tracks in a Wilson 
cloud chamber (analogous to the screen B), so that the wave and particle 
aspects of matter appear in the same experiment. 

In the situation illustrated in Fig. 1, we might at first suppose that 
the diffraction pattern is due to an interference between different photons 
passing through the two slits, thus explaining the observations entirely 
in terms of the particle picture. That this is not a sufficient explanation 
may be shown by decreasing the intensity of the light until an average of 
only one photon at a time is in transit between source and screen. The 
diffraction pattern still appears as the distribution of the large number of 
photons accumulated over a sufficiently long time. Thus we must con- 
clude that diffraction is a statistical property of a single photon and does 
not involve an interaction between photons. From the point of view of 
the particle picture, we may then ask how it is that a stream of inde- 
pendent photons, each of which presumably can go through only one of 
the slits, can produce a diffraction pattern that appears only when both 
slits are open. Or to put the question in another way, how can the 
presence of a slit through which a photon does not go prevent that photon 
from reaching a part of the screen it would be likely to reach if that slit 
were closed? 


QUANTUM-MECHANICAL VIEWPOINT 


In this question is implicit the assumption that the photon actually does 
go through a particular one of the two slits. This assumption is natural 


é 
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from the point of view of the classical theory or the old quantum theory, 
since these theories regard a photon or other particle as having a definite 
and determinable position at each instant of time. The quantum 
mechanics, however, discards this assumption and asserts instead that 
the position of a photon has meaning only when the experiment includes 
a position determination. Moreover, this part of the experiment will 
affect the remainder of the experiment and cannot be considered sepa- 
rately. Thus from the point of view of quantum mechanics, the question 
asked in the preceding paragraph is without meaning, since it assumes 
that the photon goes through a particular one of the two slits (thus 
making it possible to close the other slit) when there is no provision in the 
experiment for determining through which slit the photon actually goes. 
The quantum mechanics resolves the situation by telling us that the 
diffraction pattern is destroyed if a sufficiently careful attempt is made 
to determine through which slit each photon passes (see Sec. 4). We 
must then be prepared to forgo the customary mental picture of a photon 
(or an electron) as a classical particle that has at each instant of time a 
position that can be determined without damage to diffraction patterns 
of the type discussed here. Thus classical causality, which requires 
that the motion of a particle at any time be uniquely determinable from 
its motion at an earlier time, must also be abandoned. The new theory 
that is forced upon us in this way is so successful in other respects as well 
that, at the present state of knowledge, we must regard such classically 
incomplete descriptions as a fundamental property of nature. 


3CIUNCERTAINTY AND COMPLEMENTARITY 


Before presenting a more quantitative discussion of the diffraction experi- 
ment outlined in Sec. 2, we consider two principles that express in qualita- 
tive terms the physical content of the theory of quantum mechanics. 
We restrict ourselves here to a discussion of their meaning and give 
arguments for their validity in Sec. 4. 


UNCERTAINTY PRINCIPLE 


The first of these is the uncertainty principle, developed by Heisenberg? 
in 1927. According to this principle, it is impossible to specify precisely 
and simultaneously the values of both members of particular pairs of 
physical variables that deseribe the behavior of an atomic system. The 
members of these pairs of variables are canonically conjugate to each other 
in the hamiltonian sense: a rectangular coordinate x of a particle and the 
corresponding component of momentum pz, a component J, of angular 
momentum of a particle and its angular position ¢ in the perpendicular 


1W. Heisenberg, Z. Physik 48, 172 (1927). 
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(xy) plane, the energy E of a particle and the time ¢ at which it is meas- 
ured, etc. Put more quantitatively, the uncertainty principle states that 
the order of magnitude of the product of the uncertainties in the knowl- 
edge of the two variables must be at least Planck’s constant h divided by 
2r(h = h/2et = 1.0545 X& 10-2” erg-sec),! so that 


Ar: Ap, Dh (3.1) 
Ae: AJ, >h (3.2) 
At: AE >i (3.3) 


The relation (3.1) means that a component of the momentum of a 
particle cannot be precisely specified without loss of all knowledge of the 
corresponding component of its position at that time, that a particle 
cannot be precisely localized in a particular direction without loss of all 
knowledge of its momentum component in that direction, and that in - 
intermediate cases the product of the uncertainties of the simultaneously 
measurable values of corresponding position and momentum components 
is at least of the order of magnitude of h. Similarly, Eq. (3.2) means, 
for example, that the precise measurement of the angular position of a 
particle in an orbit carries with it the loss at that time of all knowledge 
of the component of angular momentum perpendicular to the plane of 
the orbit.2 Equation (3.3) means that an energy determination that 
has an accuracy AE must occupy at least a time interval At ~ h/AE; 
thus if a system maintains a particular state of motion not longer than 
a time At, the energy of the system in that state is uncertain by at least 
the amount AE ~ h/At, since At is the longest time interval available for 
the energy determination. The smallness of h makes the uncertainty 
principle of interest primarily in connection with systems of atomic size. 

As we shall see in Sec. 12, the uncertainty principle may be obtained 
directly from the mathematical formulation of the theory, and this is 
actually the way in which it was first obtained by Heisenberg. 


COMPLEMENTARITY PRINCIPLE . 


In order to understand the implications of the uncertainty principle in 
more physical terms, Bohr’ introduced the complementarity principle in 
1928. This principle states that atomic phenomena cannot be described 
with the completeness demanded by classical dynamics; some of the 
elements that complement each other to make up a complete classical 
description are actually mutually exclusive, and these complementary 


1 E. R. Cohen and J. W. M. DuMond, Rev. Mod. Phys. 37, 537 (1965). 

* Because of the cyclic nature of ¢, Eq. (3.2) merits further attention ; see M. M. Nieto, 
Phys. Rev. Letters 18, 182 (1967). 

*N. Bohr, Nature 121, 580 (1928); ‘“‘Atomic Theory and the Description of Nature,’’ 
especially pt. II (Cambridge, London, 1934); Phys. Rev. 48, 696 (1935). 
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elements are all necessary for the description of various aspects of the 
phenomena. From the point of view of the experimenter, the com- 
plementarity principle asserts that the physical apparatus available to 
him has such properties that more precise measurements than those 
indicated: by the uncertainty principle cannot be made. ; 

This is not to be regarded as a deficiency of the experimenter or of 
his techniques. It is rather a law of nature that, whenever an attempt is 
made to measure precisely one of the pair of canonical variables, the other 
is changed by an amount that cannot be too closely calculated without 
interfering with the primary attempt. This is fundamentally different 
from the classical situation, in which a measurement also disturbs the 
system that is under observation, but the amount of the disturbance can 
be calculated and taken into account. Thus the complementarity prin- 
ciple typifies the fundamental limitations on the classical concept that 
the behavior of atomic systems can be described independently of the 
means by which they are observed. 


LIMITATIONS ON EXPERIMENT 


In the atomic field, we must choose between various experimental arrange- 
ments, each designed to measure the two members of a pair of canonical 
variables with different degrees of precision that are compatible with the 
uncertainty relations. In particular, there are two extreme arrange- 
ments, each of which measures one member of the pair with great pre- 
cision. According to classical theory, these extreme experimental 
arrangements complement each other; the results of both may be obtained 
at once and are necessary to supply a complete classical description of the 
system. In actuality, however, the extreme complementary experiments . 
are mutually exclusive and cannot be performed together. . 

It is in this sense that the classical concept of causality disappears in 
the atomic field. There is causality insofar as the quantum laws that 
describe the behavior of atoms are perfectly definite; there is not, however, 
a causal relationship between successive configurations of an atomic sys- 
tem when we attempt to describe these configurations in classical terms. 


ACIDISCUSSION OF MEASUREMENT 


In this section we consider three fairly typical measurement experiments 
from the point of view of the new quantum mechanics. The first two 
are designed to determine the position and momentum of a particle by 
optical methods; the third is the diffraction experiment in Sec. 2. 


LOCALIZATION EXPERIMENT 


We consider a particular example of the validity of the uncertainty prin- 
ciple, making use of a position-momentum determination that is typical 


10 y QUANTUM MECHANICS 


aS 
fans 
oN 
/ \ 
/ \ 
/ \ 
/ \ 
/ \ 
rs \ 
/ \ 
: eae} 
\ é 
\ é 
ne at \ : 
x \ 4 = e * 
\ / Fig. 2. An experiment for the localiza~- 
eres 5s \ tion of a particle P by means of one of the 
Q-—> yf scattered quanta Q, which is focused by 
ee P the lens L to form an image on the screen S. 


of a number of somewhat similar experiments that have been discussed 
in connection with measurements on particles and radiation fields.! We 
shall consider here the accuracy with which the x components of the 
position and momentum vectors of a material particle can be determined 
at the same time by observing the particle through a rather idealized 
microscope by means of scattered light. 

The best resolving power of the lens L shown in Fig. 2 is known 
(either experimentally or from the theory of wave optics) to provide an 
accuracy 


An ~ 


sin ¢ (4.1) 
in a position determination, where \ is the wavelength of the radiation 
that enters the lens, and « is the half angle subtended at the particle P 
by the lens. For simplicity, we consider the case in which only one of 
the light quanta Q is scattered onto the screen S. Because of the finite 
aperture of the lens, the precise direction in which the photon is scattered 
into the lens is not known. Then since Eq. (1.2) states that the momen- 
tum of the photon after it is scattered is h/d,? the uncertainty in the z 
component of its momentum is approximately (h/d) sin e. 

The z components of the momenta of the photon and the particle 
can be accurately known before the scattering takes place, since there is 
no need then to know the x components of their positions. Also, if our 
position measurement refers to the displacement of the particle with 
respect to the microscope, there is no reason why the total momentum of 
the system (particle, photon, and microscope) need be altered during the 
scattering. Then the uncertainty Ap, in the x component of the momen- 
1 See, for example, W. Heisenberg, ‘The Physical Principles of the Quantum Theory,” 


chaps. II, III (University of Chicago Press, Chicago, 1930); D. Bohm, ee 
Theory,” chap. 5 (Prentice-Hall, Englewood Cliffs, N.J., 1951). 


2 See footnote 2, page 3. 
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tum of the particle after the scattering is equal to the corresponding 
uncertainty for the photon. 


Ap, ™~ i sin € (4.2) 


If we combine Eq. (4.1) with Eq. (4.2), we see that just after the scatter- 
ing process 


Az: Apz~h (4.3) 


is the best that we can do for the particle. Thus a realistic accounting 
of the properties of the radiation gives a result in agreement with the 
uncertainty relation (3.1) for the particle. 

This experiment may also be considered from the point of view of 
the complementarity principle. The complementary arrangements differ 
in the choice of wavelength of the observed radiation: Sufficiently small 
\ permits an accurate determination of the position of the particle just 
after the scattering process, and large of its momentum. 


MOMENTUM DETERMINATION EXPERIMENT 


The experiment just discussed assumes that the momentum of the particle 
is accurately known before the measurement takes place, and then it 
measures the position. It is found that the measurement not only gives 
a somewhat inaccurate position determination but also introduces an 
uncertainty into the momentum. 

We now consider a different experiment in which the position is 
accurately known at the beginning and the momentum is measured. 
We shall see that the measurement not only gives a somewhat inaccurate 
momentum determination but also introduces an uncertainty into the 


position. We assume that the particle is an atom in an excited state, 


which will give off a photon that has the frequency vo if the atom is at 
rest. Because of the doppler effect, motion of the atom toward the 
observer with speed v means that the observed frequency is given approxi- 
mately by 


y= VW G ae ‘) (4.4) 
so that . 


vx (2 2 ) | (4.5) 


Accurate measurement of the momentum mv by measurement of the 
frequency v requires a relatively long time 7; the minimum error in the 
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frequency measurement can be shown to be 
4 . 

The instant at which the photon is emnitied 1S uncertain by 7; at this 
instant, the momentum of the atom decreases by hv/c, and its velocity 


decreases by hy/mc. This makes the subsequent position of the atom 
uncertain by the amount 


Ar =— (4.7) 


since the later the photon is emitted, the longer the atom has the higher 
velocity and the farther it will have traveled. . This position uncertainty 
arises entirely because of the finiteness of r. If were zero, and we knew 
the velocity and the velocity change on emission of the photon, we would 
know where the atom is at each instant; it is because r is finite that we 
do not know when the velocity changed ‘aad hence where the atom is at 
later times. 

The momentum uncertainty is obtained with the Delp of Eqs. (4.5) 
and (4.6): 


mc Av mc 
~/ — 
Vo VoT 


Ap, = mAv = 


8) 


In the nonrelativistic case considered ais, v/e <1,.and Eq. (4.4) shows 
that v ~ vp. Then combination of Eqs. (4.7) and (4.8) leads to the 
minimum uncertainty relation (3.1). 


ANALYSIS OF DIFFRACTION EXPERIMENT 


Finally, we analyze the diffraction experiment of Sec. 2 from the point of 
view of the complementarity principle, assuming that the uncertainty 
principle is valid. Two contrasting arrangements, which would com- 
plement each other classically, are considered here. One of these is | 
illustrated in Fig. 1. Since it is assumed that the distance from A to B 
is large compared with the distance between the two slits, and this in turn 
is large compared with the wavelength of the light, the distribution of 
intensity in the diffraction pattern determines to good approximation 
the angular distribution of the photons leaving the slits in A and hence 
. determines the distribution of the y components of momentum of the 
photons beyond A. The second arrangement, shown in Fig. 3, deter- 
mines through which of the two slits each photon passes and hence 
provides information on the y coordinates of the photons. 

In the second arrangement each photon registers itself as it passes 
through a slit by bouncing off one of a number of indicators C placed 
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Fig.3 The experimental arrangement of Fig. 1, modified by the addition 
of photon indicators C. 


close to A, giving up to it a y component of momentum that may be 
uncertain by the amount Ap,. If we do not want the resultant diffraction 
pattern of many such photons to be destroyed by these events, the 
uncertainty in p, for a particular photon produced by its encounter with 
an indicator must be substantially smaller than would be required to 
throw the photon from a maximum of the diffraction pattern at B into a 
neighboring minimum. With a photon of momentum pz, this requires 
that 

Ap, K Opz (4.9) 


For the simple case in which R >a>>)\, the angle 6 is known experi- 
mentally (or from the theory of wave optics) to be given by 


rn 
0-5 4,10) 
- in terms of the optical wavelength \ and the distance a between the slits. 
At the same time, we have not learned through which slit this photon 
passed unless the uncertainty Ay of the y position of the indicator that 
recoiled is less than half the distance between slits. 


Ay < $a (4.11) 


It then follows from Eqs. (4.9) to (4.11) and (1.2) that the require- 
ment that we be able to determine through which slit each photon passes 
without destroying the diffraction pattern at B is equivalent to the 
requirement that ; 


Ay « Apy K.4h (4.12) 
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for each indicator that is used. Since Eq. (4.12) is in disagreement with 
the uncertainty relation (3.1), we may conclude that it is impossible to 
determine through which slits the photons pass without destroying the 
diffraction pattern. 


DISCUSSION OF DIFFRACTION EXPERIMENT 


The situation just analyzed shows the intimate connection between the 
theoretical principles of uncertainty and complementarity and the experi- 
mental observations related to localization and diffraction. It provides 
an explicit demonstration of the validity of the complementarity principle 
(represented in this case by the choice between the mutually exclusive 
but classically complementary experiments for observing the diffraction 
and for localizing the photon) when taken in conjunction with the experi- 
mentally observable properties of matter and radiation. It shows that no 
fundamental difficulty need be encountered with the photon picture so 
long as we do not insist on the degree of detail in describing the situation 
that is entailed by classical concepts. 

It is, of course, still necessary to ascribe unfamiliar properties to the 
photons in order to explain the experimental observations. The fore- 
going discussion does not show how an individual photon can interfere 
with itself to produce the diffraction pattern, nor, on the other hand, does 
it show how an electromagnetic wave can eject photoelectrons from the 
screen. Such demonstrations lie beyond the scope of the qualitative 
discussion of this chapter and require the use of the mathematical for- 
malism of quantum mechanics.!_- However, analysis of the diffraction 
experiment from the point of view of quantum mechanics removes the 
difficulty encountered in Sec. 2; the diffraction pattern disappears when- 
ever a successful attempt is made to determine through which slit each 
photon passes.? 


SCIWAVE PACKETS IN SPACE AND TIME 


The relation (1.2) between momentum and wavelength, which is known 
experimentally to be valid for-both photons and particles, suggests that 
‘ it might be possible to use concentrated bunches of waves to describe 
localized particles of matter and quanta of radiation. To fix eur ideas, 


1 Chapter 14 shows how the theory of the electromagnetic field can be modified to 
include quantum effects. 


? The matters discussed in this section are discussed more fully in the early chapters 
of the books by R. P. Feynman and A. R. Hibbs, “(Quantum Mechanics and Path 
Integrals,”” (McGraw-Hill, New York, 1965) and by R. P. Feynman, R. B. Leighton, 
and M. Sands, ‘‘The Feynman Lectures on Physics,” vol. III (Addison-Wesley, Read- 
ing, Mass., 1965). 
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Fig. 4 Plots of a typical wave packet ¥(x) and its Fourier transform. 


we shall consider a wave amplitude or wave function that depends on the 
space coordinates z, y, z and the time ¢t. This quantity y is assumed to 
have three basic properties. First, it can interfere with itself, so that it 
can account for the results of diffraction experiments. Second, it is large 
in magnitude where the particle or photon is likely to be and small else- 
where. And third, y will be regarded as describing the behavior of a 
single particle or photon, not: the statistical distribution of a number of 
such quanta. This last is an essential requirement in view of the conclu- 
sion of Sec. 2 that a single quantum of matter or radiation interferes 
with itself rather than with other quanta. In this section we shall con- 
fine ourselves to a qualitative discussion of the one-dimensional case, in 
which the wave function y depends only on zx and t, and leave the quan- 
titative development for Chap. 2. 


SPACE PACKETS 


A typical form for a concentrated bunch of waves, which we shall call a 
wave packet, is shown in Fig. 4a, where y(z,é) is plotted against x for a 
particular time ¢. The average wavelength X» and the approximate 
extension Az of the packet are indicated in the diagram. The Fourier 
integral analysis! of y with respect to z is now of interest since it shows 
how y may be built up out of continuous harmonic waves of various 
lengths. This is indicated in Fig. 4b, in which the Fourier transform of y 
is plotted schematically against the propagation number k = 2r/). 


1 See, for example, P. M. Morse and H. Feshbach, “Methods of Theoretical Physics,’’ 
sec. 4.8 (McGraw-Hill, New York, 1953). , 
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It can be shown by standard mathematical methods that 


1 
ae 
A255 


(5.1) 


where Ak is the approximate spread in propagation number associated 
with the packet. If now we correlate wavelength and momentum as in 
Eq. (1.2) we see that the spread Ak corresponds to a momentum spread 


h h . 
Combination of Eq. (5.1) with Eq. (5.2) gives 
Ax-Ap Zh (5.3) 


which agrees with the uncertainty relation (3.1). Thus the uncertainty 
principle for position and momentum of a quantum of matter or radiation 
follows directly from the wave-packet description and Eq. (1.2). 


TIME PACKETS 


In analogous fashion, we may examine the dependence of y on the time ¢ 
for a point x that is typical of the packet, and we obtain a time Fourier 
transform that shows how y can be built up out of continuous harmonic 
waves of various frequencies v. In this case the relation between the 
spread in time of y and the spread in frequency of the time Fourier trans- 
form of y is 


Mvp = (5.4) 
2r 
Equation (5.4) may be brought into correspondence with the uncer- 
tainty principle by associating the energy E of a quantum with the 
frequency of the wave that represents it in a manner similar to the 
association of momentum with wavelength given by Eq. (1.2). We 
shall make this connection through Eq. (1.1): 


E=hv 5.5) 


which may be inferred in the case of photons from the experimental 
discussion of Sec. 1. Combination of Eq. (5.4) with Eq. (5.5) then gives 
the uncertainty relation (3.3). 

The assumption that Eq. (5.5) is valid for matter as well as for radia- 
tion may be made plausible by computing the group velocity! of a wave 
packet that represents a nonrelativistic particle of mass m, kinetic energy 
E, and momentum p for which ) and » are given by Eqs. (1.2) and (5.5), 
respectively. The group velocity, which is the velocity of the center of 


1 Born, op. cit., pp. 88, 330. 
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the packet, is equal to 


dy _dE _d(p*/2m)_ pp | (5.6) 


in agreement with the classical expression for the velocity. This shows 
that, with Eq. (5.5), the wave-packet description of the motion of a 
particle agrees with the classical description when the circumstances are 
such that we can ignore the size and internal structure of the packet. 


WAVE FORMALISM 


We see then that quanta of matter or radiation can be represented in 
agreement with the uncertainty principle by wave packets that may be 
superposed to produce interference and whose magnitudes give a measure 
of likelihood of location, provided that use is made of the experimentally 
inferred relations (1.2) and (5.5). It is then possible to set up a quan- 
titative formalism based on the mathematical analysis of wave motion. 
This will be done for matter in Chap. 2, using the physical principles — 
outlined in this chapter as a guide and requiring always that the result 
of any calculation reduce to the result of the corresponding classical cal- 
culation in the appropriate limit. This requirement is a way of stating 
Bohr’s correspondence principle, which was mentioned in Sec. 2. At the 
present time, when a reasonably complete quantum theory exists, the 
correspondence principle is mainly of interest in assuring that the for- 
malism has the proper classical limit, although it may also be of heuristic 
value in performing new calculations or extending the boundaries of the 
theory. — a 

It might at first be thought that the exclusive use of a wave for- 
malism for the description of matter in the next four chapters would 
conflict with the observed particle-wave duality discussed in Sec. 1 and 
hence disagree with the complementarity principle. This is not the case, 
however, for the formalism actually provides an understanding of all the 
measurable properties of matter, including, for example, the production of 
particle tracks in a cloud chamber. Thus it will be shown in Sec. 38 
that, if a single material particle is represented by a wave function of 
definite momentum and hence completely undetermined position, the 
probability of ionization of two or more molecules of the cloud-chamber 
gas is negligibly small unless the molecules lie very nearly along a line 
parallel to the momentum vector. 

It must be emphasized that these remarks are true only if a wave 
function of the type described in this section is always interpreted as 
representing just one particle of matter and not the statistical distribu- 
tion of a number of particles. When we wish to describe more than one 
particle, we must make use of a wave function that depends on the coor- 
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dinates of all of them. The analogous analytical description of light 
quanta, which will be taken up quantitatively in Chap. 14, makes use 
of a somewhat different approach. This is mainly because photons 
(unlike particles as we consider them here) can be emitted or absorbed 
through interaction with matter, so that their number is not fixed. Thus 
a photon wave function would have to depend on a variable number of 
parameters, and it is desirable to avoid such a situation. 


PROBLEMS 


1. Give a brief description of each of the experiments referred to in Table 1, together 
with their interpretations. 


2. Describe briefly the Davisson-Germer and Thomson experiments and the use of 
the Wilson cloud chamber for the observation of particle tracks. 

3. <A beam of silver atoms for a Stern-Gerlach experiment is produced by collimat- 
ing atoms that vaporize from silver held in a furnace at 1200°C. If the beam travels 
1 meter, use the uncertainty relation to find the order of magnitude of the smallest 
spot that can be obtained at the detector. . 


4. A 1-ounce rifle bullet takes 0.5 sec to reach its target. Regarding the bullet as a 
mass point, and neglecting effects of air resistance and earth motion, find the order of 
magnitude of the spread of successive shots at the target under optimum conditions 
of aiming and firing. 

5. A perfectly elastic ping-pong ball is dropped in vacuum from a height equal to 
ten times its radius onto a perfectly elastic fixed sphere of the same radius. Neglecting 
effects due to earth motion, estimate the largest number of bounces against the fixed 
sphere that the ball can be expected to make under optimum conditions of release. 


6. A beam of monoenergetic electrons is used to excite a particular level of an atom 
in a Franck-Hertz experiment. If this level is of short duration, owing to radiation 
back to the ground state, show that the inelastically scattered electrons that have lost 
energy to produce the excited level will not all be expected to have the same final 
energy. If the excited level lasts about 10-!° sec, what is the order.of magnitude of 
the electron energy spread, measured in electron-volts? 


7. Discuss any connections you can think of that exist between the three uncertainty 
relations (3.1), (3.2), and (3.3). 

8. Derive the expression for the group velocity that is given as the left side of 
Eq. (5.6). 


2 
The Schrddinger 
Wave Equation 


This and the next several chapters are concerned with the nonrelativistic 
motion of a particle in a force field that can be represented by a potential 
energy.1 A quantitative description of the motion in terms of a dif- 
ferential equation, the Schrédinger wave equation, is developed in this 
chapter and applied to a simple one-dimensional problem. Various 
assumptions have to be made as regards the structure of the wave equa- 
tion, the boundary and continuity conditions on its solutions, and the 
physical meaning of these solutions. These assumptions are given a high 
degree of plausibility in this chapter and the next by relating them to 
experimental results, mainly those that deal with the diffraction of 
material particles and with the existence of’a classical limit to the quan- 
tum mechanics. However, no attempt is made to derive the formalism 
uniquely from a consideration of the experiments. The definitive test 


1 This is an idealization of the true situation, in which special relativity must be taken 
into account, particles can be created and destroyed, and interactions cannot be 
described by potentials. Nevertheless, it is extraordinarily useful in . dealing with 
atoms, molecules, condensed matter, and atomic nuclei. 
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of the theory must, of course, be its internal consistency and the success 
with which deductions from it agree with particular experimental, meas- 
urements; some examples will be worked out in Chaps. 4 and 5. 


6LIDEVELOPMENT OF THE WAVE EQUATION . 


The form of the Schrédinger wave equation is obtained in this section 
by generalizing the properties of the wave amplitude of Sec. 5. The 
remainder of this chapter presents a discussion of some of the properties 
of the equation and its solutions. 


TRAVELING HARMONIC WAVES 


The first task is to develop in more quantitative fashion the properties 
of the one-dimensional wave function y(z,t) that was discussed qualita- 
tively in Sec. 5. It was shown there that for a continuous traveling 
harmonic wave, the wavelength and momentum are related by Eq. (1.2) 
and the energy and frequency by Eq. (5.5). We rewrite these two equa- 
tions in terms of the universal constant h = h/27: 


p = hk k= “ (6.1) 
E =hw w = Qarp (6.2) 


A wave function y(z,t) that represents a particle of completely 
undetermined position traveling in the positive z direction with precisely 
known momentum p and kinetic energy E would then be expected to 
have one of the forms’ 


cos (kx — wt) sin (kx — wt) et(kx—wt) e—tlke—wt) — (6,3) 


or some linear combination of them. This follows from diffraction experi- 
ments like those of Davisson and Germer and of Thomson (see Sec. 1) 
and from the requirement that a wave packet of approximately the 
propagation number k and angular frequency w has a group velocity equal 
to that of a classical free particle of momentum p and energy EF [see 
Eq. (5.6)]. 


NEED FOR A WAVE EQUATION 


In order to go beyond the very simplest problem of a continuous harmonic 
wave, it is very desirable to have an equation of which both the harmonic 
waves and more complicated waves are solutions. An example from a 
more familiar field of physics should help to clarify this point. In the 
case of three-dimensional sound waves in a gas, it is possible to obtain a 
solution of the problem of the scattering of sound by a rigid sphere by 
superposing plane harmonic waves traveling in various directions. But 
it is far simpler to solve the differential equation for sound waves directly 
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in spherical polar coordinates. If the temperature of the gas changes 
from point to point, no progress can be made in the general case without 
such a differential equation. The correct underlying equation for sound 
waves can be found from direct consideration of the mechanical properties 
of the gas. Although this is not the case with the equation of which the 
wave functions of Sec. 5 are solutions, it is no less imperative to find the 
form of the equation. The need for this becomes more evident when the 
wave function is to describe the motion of a particle under the influence 
of external forces; this situation turns out to be analogous to the propaga- 
tion of sound waves in an inhomogeneous gas. We shall, therefore, find 
an equation for y and, having found it, shall regard it as a more funda- 
mental attribute of the wave functions than the harmonic forms (6.3). 

The equation must have two basic properties. First, it must be 
linear, in order that solutions of it can be superposed to produce inter- 
ference effects (in the three-dimensional case) and to permit the con- 
struction of wave packets. And, second, the coefficients of the equation 
must involve only constants such as f and the mass and charge of the 
particle, and not the parameters of a particular kind of motion of the 
particle (momentum, energy, propagation number, and frequency). The 
reason for the latter requirement is that we shall want to leave open the 
possibility of superposing solutions that belong to different values of 
these parameters; this means that such a more general wave function 
cannot be a solution of an equation that involves the parameters in its 
structure. Since differential equations are the easiest to handle, it is 
worthwhile to try this type first, and it turns out that the requirements 
can be met by a differential equation. 

With all these considerations in mind, we look first at the most 
familiar one-dimensional wave equation, that which describes the motion 
of transverse waves on a string or plane sound waves in a gas: 


ory ary 
oe ~ 1 oz? | (6.4) 


where vy is the square of the wave velocity. Substitution of the forms 
(6.3) into Eq. (6.4) shows that each of the four harmonic solutions, and 
hence any linear combination of them, satisfies this differential equation 
if and only if we put 


vas ot at (6.5) 


where m is the mass of the particle that is to be described by Eq. (6.4). 
Because of the structure of Eq. (6.5) it is apparent that the coefficient y 
that appears in Eq. (6.4) involves the parameters of the motion (E or p); 
we therefore discard this differential equation. 
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THE ONE-DIMENSIONAL WAVE EQUATION 


In looking further for a suitable equation, it is helpful to note that 
differentiation with respect to x of wave functions like those of (6.3) 
has the general effect of multiplication of the function by k (and some- 
times also interchanging sine and cosine), whereas differentiation with 
respect to ¢ has the general effect of multiplication by w. Then the rela- 
tion E = p?/2m, which is equivalent to the relation w = hk?/2m, suggests 
that the differential equation for which we are looking contains a first 
derivative with respect to ¢ and a second derivative with respect to z. 


oy soap 

Ot OY: ax? (6.6) 

Substitution shows that the first two of the wave functions (6.3) are not 

solutions of Eq. (6.6) but that either of the last two may be (but not both 

at once) if the constant y is suitably chosen. In particular, if we choose 
iw thE th 

1" 7 pt Bm (6.7) 
then the third of the wave functions (6.3) satisfies Eq. (6.6). Moreover, 
the value of y given by Eq. (6.7) involves only the constants & and m. 

We are thus led to the one-dimensional form of the Schrédinger! 
wave equation for a free particle of mass m, which from Eqs. (6.6) and 
(6.7) may be written 

4 2 
7 grea lg (6.8) 


The particular form in which Eq. (6.8) is written is significant insofar 
as its harmonic solution, the third of the wave functions (6.3), makes 
the left side Ey and the right side (p?/2m)y. The fact that the solution 
e'(kz—-w!) ig complex is not in itself a defect of the formalism. We shall 
have to be certain that all predicted results of possible physical observa- 
tions are expressible in terms of real numbers, and this will supply a 
condition on the detailed interpretation of y. 


EXTENSION TO THREE DIMENSIONS 


The foregoing one-dimensional treatment is readily extended to three 
dimensions. It is natural to rewrite Eq. (6.1) as 


p = Ak ee = (6.9) 


1K. Schrédinger, Ann. Physik 79, 361, 489 (1926); 81, 109 (1926). The present 
treatment is somewhat different from that originally given by Schrédinger. 
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where k is called the propagation vector. Similarly, the third of the wave 
functions (6.3) becomes 


exp 7(k+-r — wf) (6.10) 


where r is the position vector for the particle. Then, by an obvious 
extension of the argument that led up to Eq. (6.8), it is seen that the 
three-dimensional Schrédinger equation for a free particle that is repre- 
sented by the wave function y(r,t) is 

ay _ 


—/ vy (6.11) 


hap — Om 


A comparison of Eqs. (6.9) to (6.11) and the classical energy equation 
2 
E=j2 (6.12) 


suggests that, at least for a free particle, the energy and momentum can 
be represented by differential operators that act on the wave function y. 


E= ins p— —ihV (6.13) 


The development of Secs. 7, 8, 10, and 11 will show that these are also 
valid representations when the particle is not free. 


INCLUSION OF FORCES 


The next problem is to extend the free-particle wave equation (6.11) so 
that it includes the effects of external forces that may act on the particle. 
We shall assume for the present that these forces are of such a nature 
(electrostatic, gravitational, possibly nuclear) that they can be combined 
into a single force F that is derivable from a real potential energy V. 


F(r,t) = —VV(r,t) (6.14) 


Just as the classical relation between energy and momentum is used above 
to infer the structure of Eq. (6.11), so it is desirable now to start from the 
corresponding classical relation that includes external forces. This is 
simply expressed in terms of the potential energy 


ge ae 
E= Sai + Vor,t) (6.15) 
where E is now the total energy, and the first and second terms on the 


right side of Eq. (6.15) are the kinetic and potential energies of the 
particle, respectively. 
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Since V does not depend on p or E, Eqs. (6.15) and (6.13) suggest 
that Eq. (6.11) be generalized into 


fh sad 2p 2 ue vy + Virtv (6.16) 


This is the Schrédinger wave equation that describes the motion of a 
particle of mass m in a force field given by Eq. (6.14).! Although the 
introduction of Eq. (6.16) cannot claim as high a degree of plausibility 
as the derivation of the free-particle equation (6.11), the further discus- 
sion of the next section should make it more convincing. It is, of course, 
the agreement of solutions of Eq. (6.16) with experiment in particular 
cases that eventually demonstrates the validity and usefulness of this 
wave equation. 


7LIINTERPRETATION OF THE WAVE FUNCTION 


The wave function (r,t), which is a solution of the wave equation (6.16), 
is now assumed to provide a quantum-mechanically complete description 
of the behavior of a particle of mass m with the potential energy V (r,t) 
and hence is analogous to the classical trajectory r(é). Thus far, the 
only interpretative guide available to us is that the wave function be 
large where the particle is likely to be and small elsewhere. This has to 
be supplemented with more detailed statements that enable us to get out 
of y the maximum amount of information permitted by nature, as was 
discussed in Sec. 3. As with the structure of the wave equation, the 
correctness of our interpretation of the wave function must be judged by 
logical consistency and appeal to experimental results. 


STATISTICAL INTERPRETATION 


The phrase “likely to be” in the preceding paragraph, together with the 
discussion of Sec. 3, indicates the need for interpreting y in statistical 
terms. We can imagine a very large number of identical, independent, 
nonoverlapping regions of space, each large enough to contain all the 
physically interesting features of the motion, in each of which the behavior 
of a particle with the potential energy V (r,t) is described by the same wave 
function y(r,t); in each case r is referred to the origin of the particular 
region. Or we can imagine a very large number of independent repeti- 
tions of the same motion in the same region of space, for each of which ¢ 
is referred to the particular origin of the time. We then make the 


1 The development of the wavefunction in time can also be related to integrals over 
all possible paths of the particle; see R. P. Feynman, Rev. Mod. Phys. 20, 367 (1948), 
and the book by R. P. Feynman and A. R. Hibbs, ‘(Quantum Mechanics and Path 
Integrals’? (McGraw-Hill, New York, 1965). 
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assumption, due to Born,! that the numerical result of the measurement 
at a particular time ¢ (insofar as the time at which the measurement is 
made can be specified) of any physically meaningful quantity, such as 
position, momentum, or energy, will in general not be the same for all the 
regions. Rather, there will be a distribution of these numbers that can 
be described by a probability function. 

For example, we have seen in Sec. 5 that the result of a position 
determination is to be regarded as uncertain by an amount of the order 
of the linear dimensions of the wave function. It is natural therefore 
to regard y as a measure of the probability of finding a particle at a par- 
ticular position with respect to the origin of its region. However, a 
probability must be real and nonnegative, whereas ¥ is complex in general. 
We therefore assume that the product of y and its complex conjugate y* 
is the position probability density. 


P(t) = ¥*(r)¥(rd) = Wed)? ) (71) 


This means that P(r,t) dx dy dz is to be the probability of finding a particle 
in its volume element dz dy dz about its point r at the time ¢, when a large 
number of precise position measurements are made on independent par- 
ticles each of which is described by the one-particle wave function (r,t). 


NORMALIZATION OF y 


The probability of finding the particle somewhere in the region must be 
unity, so that Eq. (7.1) implies that the wave function is normalized: 


Sly @,t) |? dr = 1 | (7.2) 


where the integral extends over the entire region; here d*r is the three- 
dimensional volume element dz dy dz. If p is a wave packet of the type 
discussed in Sec. 5, the integral in Eq. (7.2) converges, and the numerical 
coefficient of y may be adjusted so that the integral is unity; such nor- 
malization does not change the fact that y is a solution of Eq. (6.16), 
‘which is homogeneous in ¥. There are, however, wave functions like 
that given in Eq. (6.10) for which the integral in Eq. (7.2) does not con- 
verge if taken over an infinite volume. Such wave functions require 
special consideration and will be discussed further in Secs. 10 and 11. 
For the present, we may think of the region of space in which such a 
wave function is defined as being arbitrarily large but finite; then the 
integral in Eq. (7.2) is over the finite volume of this region and converges, 
so that normalization is always possible. 

The coefficient of y that normalizes it must be independent of the 
time-in order that y may satisfy the wave equation (6.16). Thus, if 


1M. Born, Z. Physik 87, 863 (1926); Nature 119, 354 (1927). 
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Kq. (7.2) is satisfied at one instant of time, the interpretation of |p|? as a 
position probability density requires that the normalization integral be 
independent of the time. That this is actually the case may be shown 
by computing the time derivative of the integral of P over any fixed 
volume 2: 


oO By * oy* 3 
gi fn PO!) Or = fi (v3 Ot | Ot v) ax 
| = 2 fh wavy — (wy) dr 
= oF a V+ [vtvy — (wy*)¥] ar 
= 5 [Wty — (wh, dA 
Here substitution has been made for dy/dt from Eq. (6.16) and for ay*/ gt 
from the complex conjugate of Eq. (6.16). The last integral is obtained 
by partial integration (use of Green’s theorem), where A is the bounding 
surface of the region of integration and [ ], denotes the component of 
the vector in brackets in the earectign, of the outward normal to the 


surface element dA.! 
We define a vector S(r,t), 


Sirf) = s2 Ww — (wv*)¥) (7.3) 
in terms of which 
< [Pad Pies & [v-sax Bue i S, dA (7.4). 


In the case of a wave packet, for which y vanishes at great distances and 
the normalization integral converges, the surface integral is evidently 
zero when @ is the entire space. For a wave function of thé type given 
in Eq. (6.10), y can be defined in a finite region Q so that it vanishes or 
has a periodic structure along the bounding surfaces (see Sec. 10). In all 
these cases, it can be shown without difficulty that the surface integral in 
Kq. (7.4) is zero, so that the normalization integral in Eq. (7.2) is constant 
in time. 


"PROBABILITY CURRENT DENSITY 
The derivation of Eq. (7.4) also shows that the differential relation 


oP) 4 y- Sel) = 0 


1 It is convenient to adopt an order of factors such that y* precedes y [see the discussion 
of Eq. (7.7) below]. 
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is valid. This has the familiar form associated with the conservation of 
flow of a fluid of density P and current density S, in which there are no 
sources or sinks. It is thus reasonable to interpret S(r,t) given by Eq. 
(7.3) as a probability current density. This interpretation makes more 
plausible the identification of —2hV with the momentum in Eq. (6.13), 
even when a force is present. Then (h/im)V is the velocity operator, 
and it is apparent that 


h 
—_ * 
S(r,t) = real part of (v Fa vv) 


Although this interpretation of S is suggestive, it must be realized 
that S is not susceptible to direct measurement in the sense in which P is. 
It would be misleading, for example, to say that S(r,t) is the average 
measured particle flux at the point r and the time t, because a measure- 
ment of average local flux implies simultaneous high-precision measure- 
ments of position and velocity (which is equivalent to momentum) and is 
therefore inconsistent with the uncertainty relation (3.1). Nevertheless, 
it is sometimes helpful to think of S as a flux vector, especially when it 
depends only slightly or not at all on r, so that an accurate velocity 
determination can be made without impairing the usefulness of the con- 
cept of flux. 


EXPECTATION VALUE 


The existence of the position probability density P(r,t) makes it possible 
to calculate what we shall call the expectation value of the position vector 
of a particle, which is defined as the vector whose components are the 
weighted averages of the corresponding components of the position of the 
particle. The expectation value is the mathematical expectation (in the 
sense of probability theory) for the result of a single measurement, or 
it is the average of the results of a large number of measurements on 
independent systems of the type discussed at the beginning of this sec- 
tion. We write the expectation value of r as 


r) = frPQ,t) dr = fy*(x,t)ry@,t) d*r (7.5) 
which is equivalent to the three equations 
(x) = Jy*ay d*r (y) = Je*yp d*r (2) = Jy*ep d*r 


where y is normalized. The expectation value is a function only of the 
time, since y and P depend on ¢ and the space coordinates have been 
integrated out. 

The expectation values of any other physically meaningful quan- 
tities can be found in a similar way if they are functions only of the 
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particle coordinate r. Thus the expectation value of the potential 
energy is 


(V) = JVGOPG) dr = fH aQV GOVE) dr (7.6) 


A quantity such as momentum or energy must, however, be expressed 
in terms of r and ¢ before a calculation of this type can be made. We 
assume that it is possible to use the differential-operator representations 
given in Eq. (6.13) for this purpose and will justify this assumption with 
the help of the corresponding probability functions in Sec. 10 (for the 
energy) and in Sec. 11 (for the momentum). The question immediately 
arises, however, as to how such differential operators are to be combined 
with the position probability density P. ; . 

This question may be answered by imposing on the expectation 
values the reasonable requirement that : 


(ey = (BY 4 (v) 


in analogy with the classical energy equation (6.15). In terms of dif- 


ferential operators, this may be written 


(in <\ = (-E wim (7.7) 


It is apparent that Eq. (7.7) is consistent with the wave equation (6.16) 
only if the expectation value is defined in the general case with the opera- 
tor acting on y and multiplied on the left by y*. We therefore obtain, 
for example, 


(Hy = f prin ary () = j v*(—ah) Vy dr (7.8) 


Like Eq. (7. 5), the second of Eqs. (7. ae is equivalent to the three com- 
ponent equations 


— anf ys har = aa f ye OF 
(p.) = —iafyrXdr = @,) = a fee 
= an [ys hag 
(p.) = —in f wd 


EHRENFEST’S THEOREM! 


It is reasonable to expect the motion of a wave packet to agree with the 
motion of the corresponding classical particle whenever the potential 
energy changes by a negligible amount over the dimensions of the packet. 


1 P. Ehrenfest, Z. Physik 45, 455 (1927). 
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If we mean by the “position” and “momentum” vectors of the packet the 
weighted averages or expectation values of these quantities, we can show 
that the classical and quantum motions always agree. A component of 
the “velocity” of the packet will be the time rate of change of the expecta- 
tion value of that component of the position; since (x) depends only on 
the time and the 2 in the integrand of Eq. (7.5) is a variable of integration, 
this is 


= * Sin = + 3 3 
£ (a) = & f vine dtr [ves M ar + [ OMT ab dr 
This may be simplified by substituting for the time derivatives of the 


wave function and its complex conjugate from Eq. (6.16) and canceling 
the V terms, where we continue to assume for the present that V is real: 


£ (a) —_ als w*x (- tv + vv) dr 
— f (- 2 yy + vv") ay a*r| 
= 2 f tyercvy) — (vv) oH ar 
The second integral can be integrated by parts: 
f wway ar = — f (we) - vey) ar + f[, VI) dA 
where the integral of the normal component of z¥Vy* over the infinite 
bounding surface A is zero because a wave packet y vanishes at great 


distances. A second partial integration, in which the surface integral 
again vanishes, results in 


Say dir = fy*v?(ap) dr 
Thus 


£ (ay = 2 f vtlevy — v%ey)] a 


_ _ th ed eS! | 
= ey vt dr = — (p.) (7.9) 


Since (x) is seen always to be a real number from the structure of Eq. (7.5), 
Eq. (7.9) shows quite incidentally that (pz) is real; this can also be shown 
from the second of Eqs. (7.8) when y represents a wave packet, by means 
of a partial integration. 
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In similar fashion we can calculate the time rate of change of a com- 
ponent of the “momentum” of the particle as 


ere are 
qs) = ~ ih [ or Xd 
—an (fH ar+ (2% a) 


+ 0 2. 3 
-fy¥ =(- mre ty) ee 


+f (-3 Ti VW + Vor) d*r 


* a — oy 3 
-[v [go -vi]o 


OV,  / aV 
—fy ge Vor = ( =) (7.10) 


li 


again substituting from the wave equation and integrating twice by parts. 

Equations (7.9) and (7.10), together with their other components, 
constitute Ehrenfest’s theorem. They are analogous to the classical equa- 
tions of motion: 


Ehrenfest’s theorem provides an example of the correspondence principle, 
since it shows that a wave packet moves like a classical particle whenever 
the expectation value gives a good representation of the classical variable; 
this is usually the macroscopic limit in which the finite size and the 
internal structure of the packet can be ignored. 


8LIENERGY EIGENFUNCTIONS 


The Schrédinger wave equation (6.16) admits of considerable simplifica- 
tion when the potential energy V(r) does not depend on the time. It is 
then possible to express its general solution as a sum of products of 
functions of r and ¢ separately. 


SEPARATION OF THE WAVE EQUATION 


We consider a particular solution of Eq. (6.16) that can be written as a 
product: y(r,t) = u(r) f(t). A general solution can then be written as a 
sum of such separated solutions. If we substitute into Eq. (6.16) and 
divide through by the product, we obtain 


i! ee | - a Vu + van | (8.1) 
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Since the left side of Eq. (8.1) depends only on ¢ and the right side only 
on r, both sides must be equal to the same separation constant, which we 
call FE. Then the equation for f is readily integrated to give 


t@® = Ce-*Etl A 


where C is an arbitrary constant, and the equation for u becomes 


2 

| - ae v2 + ve) | u(r) = Eu(r) (8:2) 
2m 

Since Eq. (8.2) is homogeneous in u, the constant C may be chosen to 

normalize wu. Then a particular solution of the wave equation is 


¥(r,t) = u(r)e Bee (8.3) 


SIGNIFICANCE OF THE SEPARATION CONSTANT £ 


The time-derivative operator given in Eq. (6.13) as a representation of the 
total energy may be applied to the y of Eq. (8.3) to give 


. OV 
th >, 


= Hy (8.4) 
An equation of the type of Eq. (8.4) is called an eigenvalue equation; y 
is said to be an eigenfunction of the operator that appears on the left, 
and the multiplying constant E that appears on the right is called the 
corresponding eigenvalue.! An energy eigenfunction, like the ¥ in Eq. 
(8.3), is said to represent a stationary state of the particle, since |p|? is 
constant in time. 

Equation (8.2) is also an eigenvalue equation. It states that u (and 
hence also ¥) is an eigenfunction of the operator — (A?/2m)V? + V(r) 
with the same eigenvalue E. It is, of course, to be expected that y is 
an eigenfunction of this operator if it is an eigenfunction of the time- 
derivative operator since, according to the wave equation (6.16), the two 
operators are equivalent not only for separated functions of the form of 
Eq. (8.3) but also for more general solutions. 

We now anticipate the discussion of the physical significance of 
eigenfunctions and eigenvalues that will be presented in Chap. 3 and 
assume that the energy eigenvalues E are the only possible results of 
precise measurements of the total energy of the particle. It is then of 
interest to inquire whether or not physically interesting solutions u(r) 
of Eq. (8.2) exist for all real values of E. An answer cannot be obtained 
until a specification of “‘physical interest”’ is found in terms of the bound- 
ary conditions that are imposed on u(r). This specification and the 


1 The terms characteristic function and characteristic value are often used in place of 
eigenfunction and eigenvalue. 
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general character of the energy eigenvalues that are associated with 
various types of potential energy function V(r) are considered in the 
remainder of this section. 


BOUNDARY CONDITIONS AT GREAT DISTANCES 


We have thus far encountered two classes of wave functions: wave 
packets that are well localized and for which the normalization integral 
Siw? dr converges, and traveling harmonic waves like the function (6.10) 
that have a constant magnitude at great distances so that the normaliza- 
tion integral taken over an infinite volume diverges. . The first class may 
be interpreted as representing particles that, if free, are initially well 
localized or that are restrained to a particular region of space by external 
forces derived from the potential energy V(r). The second class repre- 
sents particles that are neither localized nor restrained but travel through 
the region under consideration from one distant part of space to another; 
such wave functions will be useful in describing the scattering of particles 
by a field of force.! In either case, the wave functions are bounded at 
great distances in all directions. 


CONTINUITY CONDITIONS 


The time-independent wave equation (8.2) is a second-order linear dif- 
ferential equation inr. Thus so long as V(r) is finite, whether or not it is 
continuous, a knowledge of the wave function and its gradient along a 
surface makes it possible to integrate the equation to obtain the wave 
function at any point. It is reasonable, therefore, to require that the 
wave function and its gradient be continuous, finite, and single-valued 
at every point in space, in order that a definite physical situation can be 
represented uniquely by a wave function. These requirements also have 
the consequence that the position probability density P(r) and the prob- 
ability current density S(r), defined in Sec. 7, are finite and continuous 
everywhere. 


BOUNDARY CONDITIONS FOR INFINITE POTENTIAL ENERGY 


If V(r) is infinite anywhere, the appropriate boundary condition can be 
established by a limiting process that starts from a finite V and the 
above continuity conditions. 

Suppose, for example, that there is an infinite discontinuity in V 
across a continuous surface, so that the potential energy is finite on one 
side of it and + © on the other, and we wish to determine the boundary 


1 Another possible class consists of wave functions that become infinite at large dis- 
tances; however, these are not of physical interest, since we have no reason to be 
concerned with particles for which the position probability density becomes indefi- 
nitely large in remote regions of space. 
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conditions on u(r) and Vu at this surface. The essential features of 
the problem are retained if we replace the continuous surface by the 
plane that is tangent to it at the point of interest and the continuously 
changing potential energy on one side of the surface by a constant poten- 
tial, which can, without loss of generality, be chosen to be zero since any 
constant change in V is equivalent to an equal change in E. We choose 
the origin of coordinates at the point of interest and the x axis perpendic- 
ular to the tangent plane. 

The wave equation (8.2) then separates in the three space coordi- 
nates, and the dependence of u on y and z is not affected by the dis- 
continuity in V at the plane z = 0. We wish therefore to solve the 
one-dimensional wave equation 

h? du 

~ 2m dz? 
where V(x) = 0 for « < 0, V(x) = Vo for x > 0, and we eventually 
pass to the limit Vo —> +. Ifwe assume that0 < E < Vo, the general 
solutions of Eq. (8.5) are 


+ V(z)u = Eu | (8.5) 


UC) = + ( 3 

( ) A sin ax B cos at 2<0O . one 

evs — 2 Vo — ; 
(x) Ce-6= + Deft a>od p= 4 | m{ es 2) 


The boundary condition that u be bounded at great distances requires 
that we set D = 0. Then the continuity of u at x = 0 gives the relation 
B=C, and the continuity of du/dz gives the relation aA = —8C. 
Since 8 becomes infinite when Vo does, and the solution for zx < 0 must 
be finite, the second relation shows that C becomes zero as Vo— ©, 
thus also making B zero; A is not determined from these relations but 
might be fixed by opma lization: 

Thus the boundary conditions at a surface at which there is an infi- 
nite potential step are that the wave function is zero and the component 
of the gradient of the wave function normal to the surface is not deter- 
mined. The assumption above that HE < Vois evidently not a restriction 
since Vo eventually becomes infinite. For E < 0, the sine and cosine 
in the solution for z < 0 are replaced by hyperbolic sine and cosine (which 
- is permissible since the solution need hold only near x = 0), with no 
change in the final result. It should be noted that both P and S, vanish 
as x —> 0 from the negative side, so that they are continuous at z = 0 
even though du/dz is not. 

A boundary surface of this type represents a perfectly rigid, impene- 
trable wall, since in the analogous classical situation a particle of any 
finite energy would have its x component of momentum reversed instan- 
taneously on contact with the surface. 
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ENERGY EIGENVALUES IN ONE DIMENSION 


Energy eigenfunctions that represent particles that are restrained to a 
particular region of space by the potential energy (first class) are always 
characterized by discrete eigenvalues, whereas eigenfunctions that do not 
vanish at great distances (second class) possess a continuous range of 
eigenvalues. This may be seen qualitatively by considering the nature 
of the solutions of the one-dimensional wave equation (8.5). 

We shall suppose at first that V(x) becomes equal to some constant 
value, which may be taken to be zero, for sufficiently large positive and 
negative x and that E <0. A classical particle with this total energy E 
cannot escape to infinity and, indeed, can exist in the region only if E is 
greater than or equal to the smallest value Vinin of V(x). The permitted 
form of the wave function for |x| large enough that V = 0 is evidently 
ell, where 8 = +(—2mE/h2)!. These two solutions for large positive 
and negative x can be extended in toward some intermediate point, say 
x = 0, by making use of the wave equation and the continuity conditions. 
At this point, w can always be made continuous by choosing the arbitrary 
multiplying constants for the positive and negative x solutions appropri- 
ately. Then a little reflection shows that, for an arbitrary value of £, 
they will not meet with the same value of du/dz. There may, however, 
be particular values of E for which both u and du/dz are continuous at 
z =0. The conditions for this can be seen in the following way. 

In the regions in which E < V(x), (d?u/dx?)/uw is positive, and so u is 
convex toward the x axis. Thus the two solutions that are continued in 
from :+ © have opposite signs for their ratios of slope to value, (du/dz)/u, 
so long as they are in regions for which E is always less than V. This is 
illustrated in Fig. 5b for the potential shown in Fig. 5a; both choices of the 
sign of u are shown forx <0. The points at which E = V(z) are called 
the turning points (TP) of the classical motion, since they are the limits of 
the motion of a classical particle of energy EZ, at which the particle turns 
around or reverses its motion. At these points d’u/dzx? = 0, and u has 
zero curvature. 

Clearly what is needed to make the two solutions join smoothly is a 
region in which E > V(z), in order that (d?u/dz?) /u may be negative and 
u be concave toward the x axis; this may permit the ratios of slope to value 
to become equal. Figure 5c shows the two solutions carried in until they 
meet, but for a somewhat small value of #, so that when the u’s are made 
the same at x = 0 the slopes are not equal (solid curves), or when the 
slopes are made the same at z = 0 the u’s are not equal (dotted curve on 
left and solid curve on right). Figure 5d shows a somewhat larger (less 
negative) value of EZ, and Fig. 5e a still larger value. The values of E 
and Vin are indicated on the wu axis, and the turning points are indicated 
on the zx axis, in the last three cases. 
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Fig. 5 (a) Potential-energy function and (b) solutions for large |z|; (c) and (e) show 
that either the wave function or its slope is discontinuous at x = 0 for values of & 
that are smaller (more negative) and larger, respectively, than the energy eigenvalue 
shown in (a). 


DISCRETE ENERGY LEVELS 


We see then that an eigenfunction that satisfies the boundary and 
continuity conditions, and that represents a particle bound by the 
potential energy V(x), can exist for the particular value of E illustrated 
in Fig. 5d. In analogy with the classical situation, a necessary condition 
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that such an eigenfunction exist is that Vin < 0, in which case E lies 
between Vin and 0; as in the classical case, this condition is also sufficient 
in one dimension although it is not in three dimensions (see Prob. 10, 
Chap. 4, and Secs. 9 and 15). 

If the potential energy well illustrated in Fig. 5a is sufficiently broad 
or deep, there will exist another eigenfunction corresponding to a larger 
energy eigenvalue that is still negative. Figure 6a, b, and c show a series 
of wave functions analogous to those shown in Fig. 5c, d, and e for suc- 
cessively increasing (successively less negative) values of E; both signs 
of u are shown forz <0. Thus Figs. 5d and 6b show the eigenfunctions 
for the two lowest energy eigenvalues or energy levels of a particle bound 
by the potential well V(x). It is easy to see by an extension of the fore- 
going qualitative arguments that, if there are any higher discrete energy 
levels, each eigenfunction has one more node than that corresponding to 
the next lowest eigenvalue. 


u(x) 


(b) 


(c) 


Fig.6 Solutions for sufficiently broad or deep potential and larger (less negative) 
values of Z than those shown in Fig. 5. # increases from (a) to (b) to (c) and is 
an eigenvalue in (b), where the wave function and its slope can both be continuous 
at xz = 0. 
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Thus, for a potential energy that approaches a finite constant value 
as zt — +, there may be a finite number of discrete energy levels, or in 
some cases an infinite number [if V(z) falls off slowly enough for large 
|x|], depending on V(x) and the mass of the particle. However, if 
V(z) > +o asxz— +, an argument like that given above shows that 
there will always be an infinite number of discrete energy levels; apart 
from arbitrary multiplying constants there will be just one eigenfunction 
u(x) for each of these. 


CONTINUOUS ENERGY EIGENVALUES 


It is possible to find eigenfunctions that obey the boundary and continuity 
conditions for all energy eigenvalues that exceed the smaller of the two 
numbers V(+«) and V(— ©). If, for example, the potential energy has 
the form illustrated in Fig. 5a, then solutions of the wave equation can be 
found for all positive values of E. This is because the solutions for large 
|x| are of the form 


h2 
and there is no reason why both terms should not be kept. Thus it is 
always possible to adjust the phase of each of the wave functions for large 
|x| (which is equivalent to adjusting the ratios A/B for the solutions for — 
large positive and negative z) so that they join together smoothly when 
continued in to z = 0. 2 
The classical terms periodic (or multiply periodic) and aperiodic are 

sometimes used to designate the particle motions associated with discrete 
and continuous energy eigenvalues, respectively. 


A sin alz| + B cos alz| a=+ (Ar) (8.6) 


DISCRETE AND CONTINUOUS EIGENVALUES IN THREE DIMENSIONS 


We shall assume without further discussion that all the foregoing results 
can be taken over in a natural way for the three-dimensional wave equa- 
tion (8.2). We can expect that, if V(r) - + asr— o inall directions, 
there will be an infinite set of discrete energy levels extending to +. 
If V(r) is bounded as r > © in some direction, there may be a finite or an 
infinite number of discrete levels, depending on the form of V. In this 
case, the discrete energy levels cannot exceed the smallest value that 
V(o) has in any direction. For values of E larger than this smallest 
V(), the energy eigenvalues cover a continuous range extending to +. 


9(JONE-DIMENSIONAL SQUARE WELL POTENTIAL 


As a simple explicit example of the calculation of discrete energy levels 
of a particle in quantum mechanics, we consider the one-dimensional 
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(0) 


Fig. 7 One-dimensional square well potential with (a) perfectly 
rigid walls and (b) finite potential step. 


motion of a particle that is restrained by reflecting walls that terminate a 
region of constant potential energy. Two simple types of potential 
energy are considered. Figure 7a shows a situation in which V(r) = 0 
for —a <x <a, and V(x) = + for |x| > a, corresponding to per- 
fectly rigid, impenetrable walls at the points z = +a. In Fig. 7b the 
increase in potential energy at the walls is abrupt but finite, so that 
V(x) = Vo for |x| > a; because of its appearance, this is often called a 
square well potential. The motion of a classical particle with total energy 
E less than V) is the same for both these potentials; but, as we shall see, 
the quantum-mechanical behavior is different. In general, an abrupt 
finite increase in potential energy at the boundaries of a region forces a 
particle toward the interior of the region. Such a potential may be 
thought of as a limiting case of a potential of the type shown in Fig. 5a, for 
which the force —dV/dz is always directed in toward r = 0. The force 
associated with a square well potential is zero except at the boundaries, 
so that the particle is acted on by no force except a sudden impulse 
directed toward the origin as it passes the points = +a. 


PERFECTLY RIGID WALLS 
It was shown in Sec. 8 that the wave function must vanish at the points 
xz = +a, when the potential energy has the form shown in Fig. 7a. 
From Eq. (8.5) the wave equation for |z| < a is simply 

h? du 


Im dz? 


= Ku (9.1) 


which has the general solution 


4 
u(x) = A sinaz + Beos az a= + Cr) (9.2) 
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Application of the boundary conditions at z = +a gives 


A sin aa + B cos aa = 0 


—A sin aa + B cos aa = 0 
from which we obtain 
A sin aa = 0 B cos aa = 0 


Now we do not want both A and B to be zero, since this would give 
the physically uninteresting solution u = Oeverywhere. Also, we cannot 
make both sin aa and cos aa zero for a given value of aor FE. There are 
then two possible classes of solutions: For the first class 


A=0 and cos aa = 0 
and for the second class 
B=0 and sin aa = 0 


Thus aa = nr/2, where n is an odd integer for the first class and an 
even integer for the second class. The two classes of solutions and their 
energy eigenvalues are then 


NTL 


u(x) = Boos aq n odd 
u(x) = Asin a n even 
2h 2m 2 
E= — in both cases 


It is evident that n = 0 gives the physically uninteresting result u = 0 
and that solutions for negative values of n are not linearly independent 
of those for positive n. The constants A and B can easily be chosen in 
each case so that the eigenfunctions u(x) are normalized. 

There is thus an infinite sequence of discrete energy levels that cor- 
respond to all positive integer values of the quantum number n. There 
is just one eigenfunction for each level, and the number of nodes of the 
nth eigenfunction that are within the potential well is n — 1. These 
results are in agreement with the discussion of Sec. 8. It is interesting to 
note that the order of magnitude of the lowest or ground-state energy 
level is in agreement with the uncertainty relation (3.1). The position 
uncertainty of order a implies a momentum uncertainty at least of order 
h/a, which in turn implies a minimum kinetic energy of order h?/ma?. 


FINITE POTENTIAL STEP 


When the potential energy has the form shown in Fig. 76, it is necessary 
to supplement the general solution (9.2), which is still valid for |z| < a 
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since Eq. (9.1) is unaltered there, by a solution for |z] >a. The wave 
equation in this region is 
_ du 


which has the general solution for E < Vo (bound states) 


—_ 4 

The boundary conditions at z = -+ © discussed in Sec. 8 require that we 

set D = 0 if Eq. (9.3) is to represent the solution for z > a, and C = 0 
if the solution is for z < —a. 

We now impose on the solutions (9.2) and (9.3) the requirements 


that u and du/dz be continuous at x = +a. 


A sin aa + B cos aa = Ce-8 

aA cos aa — aB sin aa = —BCe-f 
—A sin aa + B cos aa = De-* 
aA cos aa + aB sin aa = BDe-Fe 


from which we obtain 


2A sin aa = (C — D)e-# 2aA cos aa = —B(C — D)e-*e 
(9.4) 
2B cos aa = (C + D)e-F 2aB sin aa = B(C + D)e-4e 
(9.5) 


Unless A = 0 and C = D, Kgs. (9.4) have as their consequence 
eae =~ | (9.6) 
Similarly, unless B = 0 and C = —D, Eas. (9.5) give 
a tan aa = 8 | (9.7) 


Now it is impossible for Eqs. (9.6) and (9.7) to be valid at once, since 
on elimination of 6 this would require that tan? aa = —1, which in turn 
would make a imaginary and 6 negative, contrary to Eq. (9.3). _ Also, we 
do not want A, B, C, and D all to vanish. Thus the solutions may again 
be divided into two classes: For the first class 


A=0 C=D and a tan aa = 8 
and for the second class 


B=0 C=-D and - a@cotaa = —B 
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ENERGY LEVELS 


The energy levels are found from a numerical or graphical solution of 
Eqs. (9.6) and (9.7) with the definitions for a and B given in Eqs. (9.2) 
and (9.3). A simple graphical method for effecting this solution is 
described here, since it shows quite clearly the way in which the number 
of discrete levels depends on Vo anda. We put £ = aa, n = Ba, whence 
Eq. (9.7) becomes é tan & = 9, with 

_ 2mV a? 

siege Cie 

Since £ and 7 are restricted to positive values, the energy levels may be 
found in this case from the intersections in the first quadrant of the curve 
of ¢ tan ¢ plotted against ¢, with the circle of known radius (2mV oa?/ nh), 
The construction is drawn in Fig. 8 for three values of Voa?; for each of 
the two smaller of these values there is one solution of Eq. (9.7) and for 
the largest there are two. 

Figure 9 is a similar construction for the solution of Eq. (9.6) in 
which the energy levels are obtained from the intersections of the same 
circles with the curve of —£ cot é in the first quadrant. The smallest 
value of Voa? gives no solution, and the two larger values each give one. 
Thus the three increasing values of Voa? give altogether one, two, and 
three energy levels, respectively. 
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Fig.8 Graphical solution of Eq. (9.7) for three values of Voa?; the vertical dashed 
lines are the first two asymptotes of » = é tan &. 
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Fig. 9 Graphical solution of Eq. (9.6) for three values of Vra?; the vertical 
dashed line is the first asymptote of » = —écot £. 


It is clear from Figs. 8 and 9 that, for a given particle mass, the 
energy levels depend on the parameters of the potential energy through 
the combination Voa?. For Voa? between zero and m*h?/8m, there is just 
one energy level of the first class; for Voa? between 7h?/8m and four times 
this value, there is one energy level of each class, or two altogether. As 
V oa? increases, energy levels appear successively, first of one class and then 
of the other. It is not difficult to see from Eq. (9.2) that, when ordered 
according to increasing eigenvalues, the nth eigenfunction has n — 1 
nodes. 


PARITY 


It follows from the foregoing discussion that the eigenfunctions of the first 
class are even with respect to change in sign of x [u(—x) = u(z)], whereas 
the eigenfunctions of the second class are odd [u(—z) = —u(z)]. This 
division of the eigenfunctions into even and odd types is not accidental 
and will now be shown to be a direct consequence of the fact that the 
potential energy function V(z) is symmetric about x = 0. If we change 
the sign of x in the wave equation (8.5) 

h? d?u(z) 

~ Im dx? 


+ V(a)u(z) = Eu(z) (9.8) 


i 
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and if V(—z) = V(a), we obtain 

h? d?u(—2z) _ = 

mae ee ae + V(z)u(—2) = Eu( a). 

Then w(x) and u(—2) are solutions of the same wave equation with 

the same eigenvalue E. Suppose at first that there is only one linearly 

independent eigenfunction that corresponds to this energy level; then 
u(x) and u(—z) can differ only by a multiplicative constant: 


u(—zx) = eu(z) | (9.9) 


Changing the sign of x in Eq. (9.9) gives u(x) = eu(—z). From these 
two equations it follows at once that 


e = 1 or e= +1 


Thus all such eigenfunctions of a symmetric potential are either even or 
odd with respect to changes of sign of x. Such wave functions are said 
to have even or odd parity. 

If an eigenvalue has more than one linearly independent eigenfunc- 
tion, the foregoing argument breaks down, and these eigenfunctions need 
not have a definite parity; they need not be even or odd. However, we 
can easily see that linear combinations of such eigenfunctions can be 
found such that each has even or odd parity. Suppose that an eigenfunc- 
tion u(x) does not have a definite parity. It can always be written 


u(x) = ue(x) + u(x) 


where u,(z) = 3[u(z) + u(—2z)] is even, u(x) = 3[u(xz) — u(—Zz)] is odd, 
and u, and u, are linearly independent. Then if the wave equation (9.8) 
is symmetric, we can write it as 


h? du, 
~ Om dx? 


ue cl 


+(V — £)u. - 2+ (V — E)u =0 (9.10) 


On changing the sign of z in Eq. (9.10), we obtain 


phe ee 
2m dx? 


Addition and subtraction of Eqs. (9.10) and (9.11) show that u, and u, 
are separately solutions of the wave equation with the same eigenvalue E. 


+ (V — E)u. + 


—(V —£E)u, =0 (9.11) 


A SIMPLIFIED SOLUTION 


Knowledge that the solutions possess a definite parity sometimes simplifies 
the determination of the energy levels, since then we need only find the 
solution for positive x. Even solutions have zero slope and odd solutions 
have zero value at x = 0. If, for example, we wish to find the even solu- 
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tions, Eqs. (9.2) and (9.3) can be replaced at once by 


u(z) = Beos ax 0O<a<a 
u(x) = Ce-b= “>a 


Instead of making both wu and du/dz continuous at x = a, it is enough to 
make the ratio (1/u)(du/dz) continuous at x = a, since the normalizing 
constants B and C are eliminated thereby. . This gives Eq. (9.7) at once. 
Similarly, the odd solutions are 


u(x) = Asin az 0O<z<a 
u(x) = Ce-b z>a 


Then continuity of (1/u)(du/dz) atx = aimmediately gives Eq. (9.6). 


PROBLEMS 


1. Use the arguments of Sec. 6 to set up a differential equation for y that involves 
a second time derivative of y, in the case of a free particle. Discuss any solutions that 
this equation has that are not shared by the free-particle Schrédinger equation. 

2. Show that the free-particle one-dimensional Schrédinger wave equation (6.8) is 
invariant with respect to galilean transformations. Do this by showing that, when 
the transformation 2’ = x — ut, t' =¢ is applied, the transformed wave function 
¥'(z',t’) = f(2,t)¥(z,t) satisfies Eq. (6.8) with respect to the primed variables, where f 
involves only z, t, %, m, and v. Find the form of f, and show that the traveling wave 
solution y¥(z,t) = Ae‘*2-) transforms as expected. 

3. How must a wave packet y fall off for large r in order that the volume integral 
of P and the surface integral of S, in Eq. (7.4) converge? 

4. Show directly that (pz) is real for a wave packet. 

5. Show that for a three-dimensional wave packet 


5 (2!) = = (ope) + (p22) 


6. Calculate the energy levels and plot the eigenfunctions for the three bound states 
in the potential of Fig. 7) when Voa? = 642/m. Compare with the first three states 
for the potential of Fig. 7a. 

7. Discuss the relation between the energy levels for the potential of Fig. 7b and 
those for the potential V(x) = +#,z <0;V(xz) = 0,0 <x <a;V(z) = Voz >a. 
8. Show that if the potential energy V(r) is changed everywhere by a constant, 
the time-independent wave functions are unchanged. What is the effect on the 
energy eigenvalues? 


3 
Eigenfunctions and Eigenvalues 


In Chap. 2, the Schrédinger wave equation was developed and applied 
to a simple problem. Some discussion of the physical interpretation of 
the wave function was given in Sec. 7. This relied for the most part 
on the computation of average or expectation values of operators that 
represent various physical quantities. In addition, however, a position 
probability density was introduced. Thus at the present point we are 
in a position to calculate from the wave function any property that 
depends on the spatial distribution of a particle (for example, the mean- 
square deviation of its position from the average), whereas we can 
calculate only average values of operators other than position. Clearly 
what is needed to round out our interpretation of the wave function is a 
means of computing probability functions for operators in general; this 
would incidentally enable us to obtain the expressions for expectation 
values used in Sec. 7 without separate assumptions. 

In this chapter, we first set up three physical postulates from which a 
complete interpretation of the wave function can be derived, and we then 
apply them to a discussion of the total energy and momentum of a 
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particle and to an illustrative problem. The interpretation presented in 
‘Sec. 7 can then be seen to be a special case of the more general treatment 
given here. | 


10LIINTERPRETATIVE POSTULATES AND ENERGY EIGENFUNCTIONS 


We start with the wave function (r,t), which is a solution of Eq. (6.16) 
and describes the motion of a particle that has mass m and potential 
energy V(r). From this wave function we wish to obtain as complete a 
description of the properties of the motion of the particle as is consistent 
with the uncertainty relations discussed in Sec. 3. 


DYNAMICAL VARIABLES AS OPERATORS 


We first postulate that each dynamical variable that relates to the motion of 
the particle can be represented by a linear operator.1. The operator may be 
simply a multiplication operator such as r for the position, or it may 
be a differential operator such as —72AV for the momentum. 

With each operator can be associated a linear eigenvalue equation, 
defined near the beginning of Sec. 8. Thus with the operator 2 may be 
associated the equation 


Qu, = wd, (10.1) 


where v, is the eigenfunction of 2 corresponding to the eigenvalue w,. 

Our second postulate is that one or another of the eigenvalues w, is the 
only possible result of a precise measurement of the dynamical variable repre- 
sented by 2. This implies that the eigenvalues of all operators that 
represent physically measurable quantities are real numbers. 


EXPANSION IN EIGENFUNCTIONS 


We assume that all the eigenfunctions of any dynamical variable con- 
stitute a complete set of functions in the sense that an arbitrary contin- 
uous function can be expanded in terms of them. This is a mathematical, 
not a physical, assumption and will be discussed further below in con- 
nection with energy and momentum eigenfunctions.’ 

Suppose now that a particular wave function y is expanded in terms 
of the eigenfunctions v, of the operator 2. We adopt the statistical 
interpretation of y given at the beginning of Sec. 7, according to which 
there are a large number of identical independent regions of space, or 


1 An operator © is linear if it satisfies the equation Q(aivi + aev2) = aiQy1 + a2Qy2 
for arbitrary complex numbers a; and functions yy. 

2 For a discussion of completeness, see, for example, R. Courant and D. Hilbert, 
“Methods of Mathematical Physics,”’ vol. I, p. 369 (Interscience, New York, 1953). 
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intervals of time, in each of which is a particle described by y. We then 
make measurements of the dynamical variable represented by 2 on each 
of these particles. Our third physical postulate is that the number of 
measurements that result in the eigenvalue w, 1s proportional to the square of 
the magnitude of the coefficient of v, in the expansion of y. This postulate, 
due to M. Born (see page 25), enables us to associate a probability func- 
tion with any dynamical variable.!' A corollary is that we are certain 
to measure a particular eigenvalue w, only when the wave function that 
describes the particle is the corresponding eigenfunction »,. 

Rather than develop the consequences of these postulates for an 


arbitrary operator ©, we consider here the total energy of the particle, and 


in Sec. 11 the momentum. Most of the results obtained are readily 
applicable to other physical operators. 


THE TOTAL-ENERGY OPERATOR 


According to the uncertainty relation (3.3), a precise measurement of the 
total energy of a particle cannot be made in a finite length of time. Thus 
if the total energy is to have a definite value, it is essential that the 
potential energy V(r) be independent of the time. Then the operator 
— (h?/2m)V? + V(r), which was shown in Sec. 8 to be equivalent to the 
total-energy operator 74(@/dt), has eigenfunctions u(r) that need not 
involve the time. The energy-eigenvalue equation is Eq. (8.2), 


2 
| - oe Vv? + ve | ur(r) = Euz(r) (10.2) 
where the eigenfunction wz(r) corresponds to the eigenvalue E and obeys 
the boundary and continuity conditions of Sec. 8. 

As discussed in Sec. 8, the energy eigenfunctions can be divided into 
two classes: those which are well localized and are associated with discrete 
eigenvalues and those which remain finite at great distances and possess 
a continuous range of eigenvalues. 


NORMALIZATION IN A BOX 


It is often desirable to be able to treat these two classes on the same basis : 
this can be accomplished by enclosing the particle under investigation 
in a box of arbitrarily large but finite volume. The simplest physical 
situation to which this approach is applicable is one in which the walls of 
the box are perfectly rigid, so that, as shown in Sec. 8, the wave function 
vanishes there. Then the discussion of Sec. 8 shows that all the eigen- 


1 An alternative deterministic interpretation in terms of “hidden variables” rather 
than the dynamical variables, which has not been widely accepted, has been proposed 
by D. Bohm, Phys. Rev. 85, 166, 180 (1952). See also J. S. Bell, Rev. Mod. Phys. 38, 
447 (1966); D. Bohm and J. Bub, Rev. Mod. Phys. 38, 453, 470 (1966). 


48 QUANTUM MECHANICS 


values are discrete. If the box is large in comparison with the dimensions 
of physical interest in the problem, the eigenvalues that were discrete in 
the absence of the box are practically unaffected, since before the walls 
were introduced the wave functions were extremely small there. Also, 
the eigenvalues that were continuously distributed in the absence of the 
box are very closely spaced; this is shown explicitly for a free particle in 
Sec. 11. 

It is more convenient to assume that the wave functions obey pert- 
odic boundary conditions at the walls of the box than that they vanish 
there, since it is then possible to get a simpler description of the momen- 
tum eigenfunctions (see Sec. 11). We choose the finite region to be a cube 
of edge length L centered at the origin and require each wave function 
to have the same value at corresponding points of opposite faces of the 
cube, and the same derivative normal to the wall. These boundary 
conditions make the otherwise continuous eigenvalues discrete, since 
the phase of the eigenfunction at great distances is no longer arbitrary 
[see the discussion of Eq. (8.6)]. As with the rigid-walled box, the pres- 
ence of the walls has a negligible effect apart from imparting discreteness 
to the otherwise continuous eigenvalues and providing a finite volume in 
which these wave functions can be normalized; we shall continue to use 
the word “continuous” to describe these functions, even when box 
normalization is used. 


ORTHONORMALITY OF ENERGY EIGENFUNCTIONS 


The integral {|uwz(r)|? d’r, which converges in any case for the discrete set 
of eigenfunctions, converges for all eigenfunctions when they are nor- 
malized in the box of finite volume L?. The coefficient of ug can then be 
chosen so that this integral is equal to unity, and the wz(r) are normalized. 

We now show that the eigenfunctions corresponding to two different 
eigenvalues E and E’ are orthogonal, that is, that the integral of the 
product of one of them and the complex conjugate of the other over the 
common domain of the functions is zero. From Eq. (10.2) we have that 
u*.(r) satisfies the equation (where V is again assumed to be real) 


| - om Micke v@| us (t) = E’ug(e) (10.3) 


In accordance with our physical interpretation, we have assumed that E’ 
is real; this is verified below. We multiply Eq. (10.2) by uz and Eq. 
(10.3) by uz, integrate over the volume L’, and take the difference between 
the two resulting equations. The V terms cancel and leave 


2 
= =) (uz .V2un — ugV?ux,) dr = (E — E’) if utug dr (10.4) 
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The integral on the left side of Eq. (10.4) can be transformed by 
Green’s theorem into a surface integral over the surface A of the cube: 


/ (upV2uz — ugV2uz) Gr = / Vi: (upVug — uzVuz) dr 
= I, (uyVur ea UnVUr)n dA (10.5) 


where the subscript 2 designates the component of the vector in the 
direction of the outward normal to the element of area dA. Since the 
imposition of periodic boundary conditions gives each wave function and 
its normal derivative the same values at corresponding points of opposite 
faces of the cube, the outward normal derivative has opposite signs on 
opposite faces, and the surface integral in (10.5) vanishes. Then Eq. 
(10.4) tells us that, if EH ~ E’, ug and wg are orthogonal.! 

An energy eigenvalue E is said to be degenerate when two or more 
linearly independent eigenfunctions wi, wz, . .. correspond to it. 
Orthogonal linear combinations of degenerate eigerfunctions can be 
found in many different ways. For example, Ua = ait + deve can be 
made orthogonal to u; by choosing the constant coefficients a, and ae 
such that 


a1 _ _ Surus d*r 
Qe Seca]? ar 


This choice does not interfere with the normalization of u,, and u, is still 
an energy eigenfunction with the eigenvalue EL. Evidently the choice 
of orthogonal linear combinations is not unique. By an extension of 
this procedure, all the energy eigenfunctions can be made orthogonal to 
each other even though some of the eigenvalues are degenerate. — 

Such a set of eigenfunctions, each of which is normalized and 
orthogonal to each of the others, is called an orthonormal set of functions. 
We specify an orthonormal set of nondegenerate energy eigenfunctions 
by the relation 


fu (@)us(t) dr = dae (10.6) 


where dgz is the symmetric Kronecker 6 symbol that equals unity if 
E = E*‘ and is zero otherwise. If there is degeneracy, Eq. (10.6) must 
be replaced by 


fue, (t)ue.(r) dr = ozz See’ (10.7) 


1It is apparent that this proof of orthogonality can be applied to a discrete set of 
eigenfunctions even though the box is not introduced, since the u’s vanish rapidly 
at great distances and the surface integral, which is then over a sphere of infinite 
radius, is zero. The continuous set of energy eigenfunctions can also be treated 
without using the box normalization (as is the continuous set of momentum eigen- 
functions in Sec. 11). 
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where the index s distinguishes between orthogonal degenerate eigen- 
functions. It is often convenient to omit explicit mention of s and use 
Eq. (10.6) for degenerate situations as well, in which case the index s 
is implied. 


REALITY OF THE ENERGY EIGENVALUES 


We can now see directly that EZ is a real number, as has been assumed. 
We multiply Eq. (10.2) by wz(r) and integrate over the box of volume L’. 
If ux is normalized, the result is 
h? 

E=-— aA if utV°ug a'r + / V(r) |ue|? dr 
which may be expressed in terms of expectation values as (1/2m)(p?) + (V). 

The second term (V) is real since its integrand is real. The first 
term can be shown explicitly to be real by means of a partial integration: 


_ J U;V2Ur Gr = f (Vuz) « (Vugz) dr — di un(Vuz)n dA 


The volume integral is evidently real, and the surface integral [like that 
in Eq. (10.5)] vanishes because of the periodic boundary conditions at the 
walls of the box. It is interesting to note that (p”) cannot be negative. 


EXPANSION IN. ENERGY EIGENFUNCTIONS 


As mentioned near the beginning of this section, we make the mathe- 
matical assumption that all the eigenfunctions uwz(r) of the total-energy 
operator constitute a complete set of functions in the sense that an arbi- 
trary continuous function can be expanded in terms of them. Then, if 
we have any wave function y(r) at a particular instant of time that is 
normalized in the box L? and obeys periodic boundary conditions at the 
walls, the assumed existence of the expansion 


v(t) = ) Agus(r) (10.8) 
E 


makes it possible to find unique coefficients Az that do not depend onr. — 
The coefficients in the expansion (10.8) can be determined by 
multiplying both sides by wu} and integrating over the box. We assume 


that the order of ) and J d’r can be reversed! and obtain 


1 The propriety of changing the order of summations and integrations must in prin- 
ciple be investigated separately in each case. The mathematical considerations 
entailed are beyond the scope of this book, and we shall always assume that such inter- 
changes are permissible in situations of physical interest. 
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if up(r)y(r) dr = ) Az / uy (r)ug(r) dr = » Ag dee = Ap 
EB E 
(10.9) 


with the help of Eq. (10.6) or (10.7). 


THE CLOSURE PROPERTY . 
Substitution of the expression (10.9) for Az back into Eq. (10.8) gives 


vie) =) | f uke ve) ar’ | wale) 
E 
which we rearrange to give 


vie) = f ve) | > wbe)ue@) | ar (10.10) 
E 


Since ¥(r) is an arbitrary continuous function of r, Eq. (10.10) implies 
that the bracketed part of the integrand vanishes unless r’ = r, since 
otherwise the value of y at the point r given by Eq. (10.10) would change 
when the values of y at other points r’ ~ r are changed, and this is con- 
trary to the assumption that y can have an arbitrary form. Moreover, 
the integral of the term in brackets must be unity when the volume of 
integration includes the point r’ = r. We conclude therefore that 


Y ub’ yus(e) = 0 r’ ~r 
a (10.11) 
ie un(r’)uz(r) d3r’ = 1 

E 


if the volume of integration includes the point r’ = r. 

Equations (10.11) describe the closure property of the orthonormal 
set of functions uz(r) and are seen to follow directly from their complete- 
ness as expressed by Eq. (10.8), whether or not they happen to be energy 
eigenfunctions. \ 


PROBABILITY FUNCTION AND EXPECTATION VALUE 


The second and third physical postulates presented at the beginning of 
this section state that the energy eigenvalues are the only possible results 
of precise measurement of the total energy and that the probability of 
finding a particular value E when the particle is described by the wave 
function ¥(r) is proportional to |Az|?. It is easily seen that the pro- 
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portionality factor is unity, because if we put for the energy probability 
function 


P(E) = |Ag\? (10.12) 


we see that P(E) sums to unity: 


) PE) ~ > f un(r)y(r) dér f un(s’)y*(r’) dr’ 
EB 3 | 
= ff y*(r’)y(r) [> ub(rus(r’) | dr dr’ 
E 
= f waar =1 


since y is normalized; use has been made here of Eqs. (10.11). 


We can also compute the average or expectation value of the energy 
from the probability function: 


(E, = ) EP(E) = ) f Bus@)ye) ar f usey*e') @r’ (10.13) 
E E ; 


If we substitute for Eu}(r) from Eq. (10.3), the first integral in Eq. (10.13) 
can be integrated twice by parts as follows: 


J Eut(r)y(s) dr = if vr) |- x V? + ve| u(r) dir 


h? 
= * a5 8 3 
[ uz (fr) am @| ¥(r) d r 
The two surface integrals that result from the partial integrations vanish 


because of the periodic boundary conditions on wg and y. Thus, with 
the help of Eqs. (10.11), Eq. (10.13) becomes 


= fuae@|— ge + VO |HO ar f water) ay 
E 
= ff y*(r’) {| - = v2 +4 ve) | v«)| [> ub(ryus(e’) | dr d3r’ 
E 


-fvo|-Ev+ve lve (10.14) 


The result embodied in Eq. (10.14) confirms the supposition made 
in Sec. 7 that the expectation value of an operator is to be calculated by 
inserting the operator between y*(r) and y(r), so that it operates just on 
the latter, and then integrating over r. 
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GENERAL SOLUTION OF THE SCHRODINGER EQUATION 


If the potential energy V(r) is independent of ¢ and we know the solution 
of the Schrédinger equation (6.16) at a particular time, we can write a 
formal expression for the solution at any time. We expand y(r,é) in 
energy eigenfunctions at the time ¢, in which case the expansion coeffi- 
cients depend on the time. 


ve) =D An@uae) — Aa) = fue dr (00.15) 
Substitution of Eq. (10.15) into the wave equation (6.16) gives 


ih use) 5 As) =) As()Eustt) (10.16) 
E E 


Because of the orthonormality of the uz, Eq. (10.16) is equivalent to 
in © Ag() = EAs(t) 


which may be integrated at once to give | 
Ag(t) = Axg(to)e7*# (t-te) /A (10.17) 


Note that P(E) = |Ax(é)|? = |Az(to)|? is constant in time. 
Thus, if y(r,¢) is known at the time ¢ = to, the solution at any time f 
is given in terms of Eqs. (10.15) and (10.17): ‘ 


VE) = J Al to)eF-19 ug (r) 
E 


Ax(to) = fuz(t’)W(t’ to) dr’ 


(10.18) 


or 


vet) = f [> WE (uw (tye -19 9 | (sto) dtr’ (10.19) 
i E 


The solution (10.18) is a linear combination of the separated solutions 
(8.3) obtained earlier. 


11OMOMENTUM EIGENFUNCTIONS . 


The eigenfunctions of the linear momentum operator —7hV provide a 
second instructive example of the application of the general ideas devel- 
oped at the beginning of the preceding section. They also are of con- 
siderable usefulness in solving problems of physical interest. 


FORM OF THE EIGENFUNCTIONS 


The momentum eigenfunctions are solutions of the three eigenvalue 
equations | 


—thVu,(r) = pu,(r) ee ee | (11.1) 
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or 


a) .. O 
— th = Up(t) = petip(t) —th 5 Uolt) = Pitie(®) 


— ih 2 ug(t) = patig(t) 


They have the form 


uper 
up(r) = C exp pe) ) 
where C is a normalization constant. 
It is convenient, as in Sec. 6, to change from the momentum vector 


p to the propagation vector k = p/h and to rewrite the momentum 
eigenfunctions 


u(r) = C exp (ik: r) (11.2) 


These are eigenfunctions of the momentum operator with the eigen- 
values hk. 


BOX NORMALIZATION 


As with the energy eigenfunctions discussed in Sec. 10, we can restrict 
the domain of the u,(r) to an arbitrarily large but finite cubical box of 
volume L? centered at the origin, at the walls of which the functions obey 
periodic boundary conditions. Then u, is normalized if C = L-#. Also, 
k is no longer an arbitrary real vector; its components are restricted to 
the values 


2rn, 
L 


where nz, n,, and n, are positive or negative integers or zero. The spacing 
of neighboring k vectors and of their energy eigenvalues fi?k?/2m can be 
made as small as desired by making LF sufficiently large. It is always 
assumed that the limit L — © is taken at the end of the calculation. 

It is interesting to note that the momentum eigenfunctions (11.2) 
cannot exist within a box that has perfectly rigid walls, since these eigen- 
functions do not vanish anywhere. This is analogous to the classical 
situation in which the momentum of a particle that is reflected from a 
rigid wall is not conserved. On the other hand, the cubical box with 
periodic boundary conditions is equivalent to a situation in which the 
entire infinite space is divided into adjacent cubes and all wave func- 
tions are periodic throughout space with the period L along each of the 
three cartesian axes. If the periodicity of the space is carried over to 
the analogous classical situation, a particle passing through a wall would 


i eae k= 


k, = 


(11.3) 


Se 
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be equivalent to one that strikes that wall and appears at the correspond- 
ing point of the opposite wall with its momentum vector unchanged. 
The orthonormality of the momentum eigenfunctions 


u(r) = L- exp (tk-r) (11.4) 


is readily established. For integration over the volume L’ 
if ut (r)ux(r) dr 


1 see. LC, a 

pagel t(k,—l,) x i(ky—ly)y i(k,—1,)z 

Zi ie dz [ —° dy Pie dz 
= x,t, 5x,l, 6.1, => Ou (11.5) 

where the 6’s are Kronecker 6 symbols and use is made of Eqs. (11.3). 

Orthogonality could also have been shown by the more general method 

used in Sec. 10 for the energy eigenfunctions [see Eq. (10.4)]. 


THE DIRAC 6 FUNCTION 


It was stated in Sec. 10 that continuous sets of eigenfunctions can be 
handled without introducing the box with periodic boundary conditions 
(which has the effect of making the set discrete with an arbitrarily small 
spacing of eigenvalues). This can be shown explicitly for the momentum 
eigenfunctions with the help of the Dirac 6 function,! which can be 
defined by the relations 


d(x) = 0 if z#0 f &(z) dx = 1 (11.6) 


where the region of integration includes the point sr = 0. An equivalent 
definition is that, for an arbitrary function f(z) that is continuous at 
x = 0, the equation 


Jf(z) 8(x) dx = f() (11.7) 


is valid, where again the integration includes the point z = 0. 

It is apparent from a comparison of Eqs. (11.6) and (10.11), or of 
Eqs. (11.7) and (10.10), that the bracketed quantity in Eq. (10.10) can 
be expressed in terms of 5 functions: 


d uE@’)us(t) = (x — 2’) dy — y’) 6@ — 2’) = Hr — 1’) (11.8) 
E 


Comparison of Eqs. (11.8).and (10.6) shows that the closure property is a 
kind of orthonormality of the eigenfunctions with respect to summation 
over the eigenvalues. 


1P. A. M. Dirac, ‘‘The Principles of Quantum Mechanies,”’ 4th ed., sec. 15 (Oxford, 
New York, 1958). = 
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A REPRESENTATION OF THE 5 FUNCTION 


The definition (11.6) or (11.7) shows that 6(x) is an asin singular 
function.!. It may be thought of qualitatively as being zero everywhere 
except at x = 0 and being so large there that the area between it and the 
z axis is finite and equal to unity. More formally, it can be represented in 
a number of different ways as the limit of a sequence of analytic functions. 

A particular representation that is quite useful involves (sin gx) /mz 
as a function of z, where g is a positive real number. This has the value 
g/m at x = 0, oscillates with decreasing amplitude and with period 21/g 
as |x| increases, and has unit integral from z = — © tox = + indepen- 
dently of the value of g. Thus the limit of this function as g > © has all 
the properties of the 6 function: It becomes infinitely large at x = 0, 
it has unit integral, and the infinitely rapid oscillations as |z| increases 
mean that the entire contribution to an integral containing this function 
comes from the infinitesimal neighborhood of z = 0. We can therefore 
put 


5(x) = lim ae , (11.9) 


g-> us 


NORMALIZATION IN TERMS OF THE 6 FUNCTION 


The representation (11.9) of the 6 function can be used to set up an 
orthonormality integral like that given in Eq. (11.5), where now we do 
not impose the box normalization but allow the momentum eigenfunc- 
tions to have the form (11.2) over all space with all real vectors k. The . 
integral fut (r)u,(r) dr is the product of three integrals, each of which 
can be expressed in terms of a 6 function: 


[2 tts de = Tim [ett de 
—« 


g7 0 

_ y 2sin g(k, — 1) 

= ae kz — le 

= 2rb(kz — lz) . (11.10) 


Thus the momentum eigenfunctions in infinite space can be written 
ux(r) = (8x*)-? exp (ik - r) (11.11) 


1 A rigorous mathematical basis has been provided for the 6 function by L. Schwartz. 
It may be thought of as a linear functional that operates on a set of sufficiently well- 
behaved test functions; thus the left side of Eq. (11.7) could be written 8[f]. For 
detailed discussions, see M. J. Lighthill, ‘Introduction to Fourier Analysis and Gener- 
alized Functions” (Cambridge, New York, 1958); A. Messiah, ‘Quantum Mechanics,’’ 
vol. I, App. A (Interscience, New York, 1961). 


EIGENFUNCTIONS AND EIGENVALUES 57 


in which case the orthonormality relation becomes 


Juk(r)uy(r) dr = 5(ke — Ie) 8(ky — ly) 6(ke — 4) = 8°(k — 1) 
(11.12) 


It will be shown in Sec. 12 that the box and 6-function normalizations 
of the momentum eigenfunctions give the same final result in a typical 
problem. 


SOME PROPERTIES OF THE 6 FUNCTION 


It is important to note that, because of its singular character, the 6 
function cannot be the end result of a calculation and has meaning only 
so long as a subsequent integration over its argument is carried out. 
With this understanding we can write some relations between 6 functions.} 


d(x) = 6(—2) 

i(z) = —8'(—2) 

xz 6(z) = 0 

x 6'(x) = —6(z) (11.13) 
é(ax) = a— 6(z) a>0o 

5(z* — a*) = (2a)-[5(@ — a) + 8(@ + a)] a>0o 


— f &a — x) 6(@ — b) dx = S(a — B) 
f(x) 6(a — a) = f(a) 6(@ — a) 


Here, a prime denotes differentiation with respect to the argument. 

_ Each of the first six of these equations can be established by multi- 
plying both sides by a continuous, differentiable function f(x) and inte- 
grating over z. For example, the fourth of Eqs. (11.13) gives | 


[ Hee 8) de = — f 62) 5 laf @)) dx 
— f s@)[f@) + 2f'@) de = — f fe) (2) de 


where the boundary terms that result from the partial integration vanish. 
Thus x8’(x) has the same effect when it is a factor in an integrand as has 
—8(x). Similarly the seventh of Eqs. (11.13) means that the two sides 
give the same result when multiplied by f(a) or f(b) and integrated over a 
or b. The last equation is verified by integrating both sides over either 
x or a. 


I 


CLOSURE 


The closure property of the momentum eigenfunctions, with both box 
and 6-function normalization, can be established without the help of the 


1 Dirac, op. cit., p. 60. 
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completeness assumption that was made in Sec. 10 for the energy eigen- 
functions. With box normalization, the expression analogous to the left 
side of Eq. (11.8) is 

“ ; a -) ry 00 * : 
ux(r’)u,(r) = L-3 e2ntin, (2-2!) +n, (yy!) +n, (2-2) /L 
y ye 


Ng =~ O Ny=—O n= —w 
This is readily evaluated in the limit of large L, in which case the sum- 
mand changes by a negligible amount as each n changes by one unit. 


We can then regard n, as a continuous variable and replace y by 


nz=— 0 


fe. dnz = (L/27) i dk,. We thus obtain 
y Ux (t’)ux(t) => 
k 


(833)-1 f a! is o eilkae—z2) +h y-v +k (e-2)1 di, dh, dhg 
= 6(z — 2’) by — y’) be — 2’) = Hr — vr’) (11.14) 
on making use of Eq. (11.10). 


A similar calculation can be carried through, using the 6-function 
normalization, in which case we obtain from Eqs. (11.11) and (11.10) 


Juz (e)ug(r) de = fffug(r’)uy(r) dk: dk, dk, = 6%r —r’) (11.15) 


The closure relation (11.14) or (11.15) shows that the momentum. 
eigenfunctions are orthonormal with respect to summation or integration 
over the eigenvalue k as well as with respect to integration over the 
position vector r. 


EXPANSION IN MOMENTUM EIGENFUNCTIONS 


An arbitrary continuous function y¥(r) can be written in terms of the 6 
function as 


¥(r) = fe’) 68 — 1’) d*r’ (11.16) 


If we substitute the left side of Eq. (11.14) in place of 6%(r — r’) in Eq. 
(11.16), we obtain 


vee) = f ve") > ub @)ucte) ar! = Y Agun(e) 
: . (11.17) 
Ay = J uc (tp (r’) dir’ 
Similarly, if we substitute for 69(r — r’) from Eq. (11.15), we obtain 
W(x) = fe) Suz )uc(r) dk dr’ = fAyuy(r) dk (11.18) 
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with the same expression for Ay. Equations (11.17) and (11.18) show 
that it is possible to expand an arbitrary function nm momentum eigen- 
functions that are normalized either in a box or by means of 6 functions.! 


PROBABILITY FUNCTION AND EXPECTATION VALUE 

The momentum probability function associated with a normalized wave 
function ¥(r) is proportional to |A,|?.. The proportionality factor is 
unity, since if we put 


P(k) = |A,/? (11.19) 

it is easily shown in analogy with the summation of Eq. (10.12) that 
Pe) =1 and f Pde ak =1 (11.20) 
k 


for the box and 6-function normalization, respectively. 
The expectation value of the momentum when box normalization 
is used is . 


(p) =A) kP(e) =a) f tut ye) dtr f ug(e Wr’) dr” (11.21) 
k k 


From the complex conjugate of Eq. (11.2), we can replace kug(r) by 
iVug(r). Then the first integral in Eq. (11.21) can be integrated by 
parts, and the surface integral vanishes because of the periodic boundary 
conditions on y and ug. Thus, with the help of Eq. (11.14), Eq. (11.21) 
becomes 


(P) 


~iny i ux (1) Vy(r) dr / uy (r’)W*(r’) dr’ 
k. 


—ih / / V*(r’)[Vy(r)] 8%(r — 2’) d2r dr’ 
= —an [ y*@)V¥@) a (11.22) 


This is in agreement with the second of Eqs. (7.8). 

When 6-function normalization is used, the details of the calculation 
are very similar to those given above, except that the surface integral 
that results from the partial integration is over a sphere of infinite radius; 
it is zero because y becomes vanishingly small at great distances. This is 
consistent with the supposition that y is normalized; otherwise neither 
JP(k) d*k nor (p) have any physical meaning. The result of the calcula- 
tion in this case is the same as Eqs. (11.22) and (7.8). 


1 These results, although not rigorously established here, are equivalent to the mathe- 
matical theorems on the expansibility of functions in Fourier series and Fourier 
integrals. . 
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12LJMOTION OF A FREE WAVE PACKET IN ONE DIMENSION 


The motion of a free particle (no external forces) in one dimension is 
described by the Schrédinger wave equation (6.8). The study of this 
motion provides an interesting application of the expansion. techniques 
developed in Secs. 10 and 11. As a first step, we find the minimum value 
of the uncertainty product given in Eq. (3.1) and the possible forms of 
the one-dimensional wave packet that correspond to it, all at a definite 
instant of time. The structure of this minimum packet is the same 
whether or not the particle is free, since this form can be regarded simply 
as an initial condition on the solution of the Schrodinger equation for 
any V. However, the analytical work involved in finding y at, other 
times is especially simple in the force-free case. 


THE MINIMUM UNCERTAINTY PRODUCT: _ 


In order to find the minimum value for the uncertainty product Az - Ap, 
we must first define what we mean by Az and Ap. Although many 
expressions are possible, the simplest to handle analytically is the root- 
mean-square deviation from the mean, where the word “mean” implies 
the expectation value of Sec. 7.? 


(Ax)? = ((@@ — (&))?) = (x?) — (22(2)). + (()?) = (2?) — (a)? 
(Ap)? = ((p — (p))?) = (p?) — {p)? 


Here the equalities follow directly from the general definition of expecta- 
tion value given in Sec. 7. If now we put 


(12.1) 


ase-@) bep-@=--w(Z-4)) arg 
then 
(Ax)? (Ap)? = J" vray de [yey ae 
= {2 tv) de ("GG dz (12.3) 


The transformation of the a integral in Eq. (12.3) is obvious; the similar 
transformation of the 8 integral follows from a partial integration when 
we remember that y is a normalized wave packet, which vanishes at 
r= to, 


1W. Heisenberg, ‘“‘The Physical Principles of the Quantum Theory,” pp. 17-19 
(University of Chicago Press, Chicago, 1930). 

2 Note that Az and Ap are each calculated separately from y. We do not consider 
the possible existence of a ‘joint probability distribution” for z and p; see P. Suppes, 
Phil. Sci. 28, 378 (1961). 
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The inequality 


_ , Jhg* da |? : 
J \f—9Figrae| & 2° 
where all integrals are from x = — © to +, is obviously true, and the 


equality is applicable only if f = yg, where y is a constant. ‘From this 
inequality we obtain at once 


Sl fl? dx f\gl? dx > |ff*g dal? 
If now we replace f by ay and g by By, Eq. (12.3) becomes 
(Ax)? (Ap)? > |f(a*w*) (6p) dx|? = |fy*aBy dz|? (12.4) 
The last term in Eq. (12.4) can be written . 
[S¥*[E (a8 — Ba) + $(aB + Ba)]y de? 
= 4£|J¥*(a8 — Ba)y dz|? + Z/f¥*(eB + Ba)pdz|? (12.5) 


The cross term in the product that is omitted on the right side of Eq. 
(12.5) can be seen to vanish when use is made of the relation 


(fy*apy dx)* = fya*p*y* dx = [(B*Y*) (ap) dx = Jy*Bay dx 


which is obtained by using partial integration and remembering that a 
is real. Now from Eq. (12.2) 


(aB — Ba)y = —th Eee hae = (ot) | = thy | (12.6) 


We thus obtain from Eqs. (12.4) to (12.6) 
(Aa)? (Ap)? > th? or Ax: Ap > gh (12.7) 


where the equality can hold only if the second term on the right side of 
Eq. (12.5) is zero. This is the precise expression of the Heisenberg 
uncertainty relation (3.1), when the uncertainties Az and Ap are defined 
as in Eq. (12.1). 


FORM OF THE MINIMUM PACKET 


It follows from the foregoing derivation that the minimum uncertainty 
product is attained only wheh two conditions are fulfilled: 


ay = by . (12.8) 
S¥*(aB + Ba)ydx =O | (12.9) 
Equations (12.8) and (12.2) give us a differential equation for y 


t-(56¢-o+ Ply 
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which is readily integrated to give 
_ toe ays 4 Unie 
y(z) = N exp E (x — (x))? + 7, (12.10) 


where N is an arbitrary constant. 
' Equation (12.9), with the help of Eq. (12.8), becomes 


ie fi 
€ + Je) [ vat de =0 


which evidently requires that y be pure imaginary. Then, since we want 
Eq. (12.10) to represent a wave packet for which the integral of ||? con- 
verges, y must be negative imaginary. The magnitude of the constant 
N can now be fixed by normalizing y. 


flv)? dz = 1 
Similarly, y can be determined by requiring that 
S@ — ())*|y|? dx = (Az)? 


The integrals are readily evaluated and lead to the normalized minimum 
wave packet 


v(x) = [2n(Ax)?}+t exp | - Fae + ad (12.11) 


MOMENTUM EXPANSION COEFFICIENTS 


The one-dimensional momentum eigenfunctions analogous to Eqs. (11.4) 
and (11.11) are 


ux(xz) = Lo-tetks (12.12) 
for normalization in a one-dimensional “box” of length L, and 
Ux(x) = (2r)-be*e (12.13) 


for 6-function normalization. Since for a free particle the wave equation 
has the simple form of Eq. (6.8) 


tha—= —— 2 (12.14) 


the momentum eigenfunctions are also eigenfunctions of the energy.} 
Thus any solution of the wave equation can be written in a form analogous 
to Eq. (10.18), 


W(2,t) = O or / dk) Ape*Butlxg, (22) (12.15) 
k 


1 The converse is not necessarily true, since there are two solutions of Eq. (12.16) 
for k (positive and negative) for each value of Ex. 
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where the A, are independent of x and ¢; the entire time dependence is 

contained in the exponential factor. Equation (12.15) is readily verified 

to be a solution of Eq. (12.14) by direct substitution, provided that 
h?k? 


a (12.16) 


k 

The problem of finding the motion of a wave packet is thus resolved 

into finding the expansion coefficient A, at some particular time, say 

= 0, and then using Eqs. (12.15) and (12.16) to find y(z,t) at other 

times. At ¢ = 0, the exponential factors in Eq. (12.15) are unity, and 

we may use the one-dimensional analog of the second of Eqs. (11.17) to 
find Ay. 


Ax = Sug (z)¥(z,0) dx (12.17) 


The limits on the integral are x = +#L or x = +, according as box 
or 6-function normalization is used. The momentum probability func- 
tion P(k) = |A,e-*#«/*/2 = |A,|? is independent of the time, so that (p) 
and Ap, for example, are constants. 


CHANGE WITH TIME OF A MINIMUM PACKET 


As a simple specific example, we take ¥(x,0) to have the form of Eq. 
(12.11) with (2) = (p) = 0, so that the wave packet initially is centered 
at « = 0 and has zero average momentum. Then, using box normaliza- 
tion, Eq. (12.17) gives 


L x : 
= 2 2]-3 Sek eS ee 
Ay = [2rL?(Azx)?] ie exp | Aan)? ike | dx 
21% 
= beeex en kU Az)? (12.18) 
where L is assumed to be so large that the contribution to the integral 


from |z| > 3L can be neglected. Substitution into Eq. (12.15) gives the 
wave function for general values of ¢: 


W(x,t) = Ape 2m, (x) (12.19) 
k 


where k = 2rn/L and n takes on all positive and negative integer values 
and zero. As in Sec. 11, LZ may be taken arbitrarily large, n can be 
regarded as a continuous variable, and the summation replaced by fdn, 
which in turn is the same as (L/2r)fdk. Thus 


Az)? |? pe ak? : 
¥(z,t) = Ee f. exp | a a _ _ + ie dk 


= (27)? ( Ax + ma exp — pid we (12.20) 
2m Ax 4(Ax)? + 2iht/m 
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The position probability density is then 


W(2,0)/* = {2 | (a? + map|} 


Ekg 
2[ (Ax)? + A7?/4m?(Az)* 


Equation (12.21) is of the same form as |¥(z,0)|?, except that (Ar)? 
is replaced by (Az)? + h?t?/4m?(Az)?, which is equal to (Ax)? + (Ap)?é?2/m?. 
Thus the center of the packet remains at x = 0 while the breadth of the 
packet increases as ¢ departs from zero in both past and future directions. 
The smaller the initial uncertainty in position, the larger the uncertainty 
in momentum and the more rapidly the packet spreads; the time-depend- 
ent part of the above expression, ¢(Ap)/m, is simply the distance traveled 
by a classical particle of momentum Ap in the time ¢. 

Use of the 6-function normalization does not alter the results of the 
foregoing calculation. The expression for A; given in Eq. (12.18) is to 
be multiplied by (L/27)!; in Eq. (12.19) the summation is to be replaced 
directly by fdk, thus eliminating a factor L/2z; finally, uw, in Eq. (12.19) 
is to be multiplied by (L/27)!. These three factors cancel, and so Eqs. 
(12.20) and (12.21) are unaffected by the choice of normalization of the 
momentum eigenfunctions. 


exp — (12.21) 


CLASSICAL LIMIT 


We have seen in Sec. 7 that a wave packet always moves like a classical 
particle insofar as the expectation values of its position and momentum 
are concerned. However, classical dynamics is useful as a description of 
the motion only if the spreading of the wave packet can be neglected over 
times of interest in the particular problem. 

As a simple example of the kind of parameter that indicates when the 
classical limit is realized, we consider a wave packet that corresponds to a 
classical particle moving in a circular orbit of radius a and period T. We 
shall assume that this packet is sufficiently well localized so that the 
potential energy does not vary appreciably over its dimensions. Then 
the classical theory can provide a useful description of the motion only 
if a wave packet like that discussed above spreads by an amount that is 
small in comparison with a during a time that is large in comparison 
with T. The smallest spread of a packet during a time interval of mag- 
nitude ¢ is attained when Az is chosen to be of order (At/m)!. We thus 
require that (At/m)?<«<a when t>>T. This condition may be simply 
expressed by saying that the angular momentum 21ma?/T of the particle 
must be very large in comparison with. Thus for most atomic systems, 
where the angular momentum of each electron is of order fi, a wave 
packet corresponding to a well-localized particle spreads so much in one 
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period that this type of description of the motion is not of physical 
interest. 


PROBLEMS 


1. Given three degenerate eigenfunctions that are linearly independent although 
not necessarily orthogonal, find three linear combinations of them that are orthog- 
onal to each other and are normalized. Are the three new combinations eigen- 
functions? If so, are they degenerate? 


2. Show that so far as the one-dimensional motion of a particle is concerned, the 
functions uz-(z) = 5(z — zx’) for all real x’ constitute a complete orthonormal set 
and that each of them is an eigenfunction of the position variable z with the eigen- 
value z’. Set up the position probability function and compare with that obtained 
in Sec. 7. 


3. If the potential energy V(x) in a one-dimensional problem is a monotonic 
increasing function of z and independent of the time, show that the functions uy.(z) = 
(dV /dz)-+,-2 8(z — x’) for all real z’, where V’ = V(z’), constitute a complete 
orthonormal set of eigenfunctions of the potential energy with eigenvalues V’. Find 
the probability function for the potential energy, and show that it has the properties 
that would be expected of it. 

4. What changes are needed in the discussion of the momentum eigenfunctions 
given in Sec. 11 if the normalization is carried through in a box of rectangular parallel- 
epiped shape rather than in a box of cubical shape? 

5. Find two other representations for the Dirac 5 function like that given in Eq. (11.9). 
6. Verify each of Eqs. (11.13) involving 6 functions. 

7. Show that the two Eqs. (11.20) are correct: that the momentum probability 
function defined in Eqs. (11.19) and (11.17) for a normalized y sums or integrates to 
unity. 

8 The expression in brackets in the integrand of Eq. (10.19) enables one to calculate 
y at time ?’ in terms of y at time ¢. If this expression is called 7G(z’,t’ ;z,t) in the one- 
dimensional case, then ¥(2’,t’) = ifG@(x’,t’;z,t)y(a,t) dz.. Show that for a free particle 
in one dimension 


™m 


4 
at, ee ae im (x'—2)2/2A(t'—t) 
Go(z',t j2,t) t | sae = 5 | eim(z'—z 
Assume that y has the form of the normalized minimum wave packet (12.11) at 
t = 0; use the above result to find y and |y|? at another time t’. This Gp is called the 


free-particle Green’s function in one dimension (see Sec. 36). 


9. Let u(r) and u2(r) be two eigenfunctions of the same hamiltonian that correspond 
to the same energy eigenvalue; they may be the same function, or they may be degen- 
erate. Show that 


ful (t)(xpe + pet)us(r) dr = 0 


where the momentum operator p, = —ih(d/dz) operates on everything to its right. 
What is the relation of this problem to Prob. 5, Chap. 2? 


4 . 
Discrete Eigenvalues: 
Bound States 


The formalism that was developed in Chap. 2 and elaborated in Chap. 
3 will now be applied to the explicit computation of discrete energy levels 
and the corresponding eigenfunctions. The next chapter will take up 
situations in which the energy eigenvalues are continuously distributed. 
Thus we are concerned here with bound states in which the particle is 
restrained by the external forces (potential energy) to a particular region 
of space, and in the next chapter with collision problems in which the 
particle can approach from and recede to infinite distance. 

The relatively few potential-energy functions V(r) for which analytic 
solutions of the wave equation (8.2) are possible are important beyond 
these immediate problems, since they often serve as bases for approxi- 
mate calculations on more complicated systems. 


13CLINEAR HARMONIC OSCILLATOR 


The one-dimensional motion of a point mass attracted to a fixed center 
by a force that is proportional to the displacement from that center 
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provides one of the fundamental problems of classical dynamics. Its 
study is important not only for itself but also because more complicated 
systems can always be analyzed in terms of normal modes of motion 
whenever the interparticle forces are linear functions of the relative dis- 
placements, and these normal modes are formally equivalent to harmonic 
oscillators. This linearity of the newtonian equations of motion means 
that the potential energy of the system is a bilinear function of the 
coordinates; since this aspect carries over into quantum mechanics, har- 
monic normal modes can again be found. Thus the one-dimensional 
harmonic oscillator is very important for the quantum-mechanical treat- 
ment of such problems as the vibrations of individual atoms in molecules 
and in crystals. It also provides the key to quantum theory of the 
electromagnetic field. The vibrations of the electromagnetic field in a 
cavity can be analyzed into harmonic normal modes, each of which has 
energy levels of the oscillator type [see Eq. (13.8)]. Excitation of a par- 
ticular mode of angular frequency w to the nth excited state means that 
the energy of the system is increased by nhw and corresponds physically 
to the addition of m photons of this frequency to the system (see Chap. 
14). 

We shall solve the oscillator problem in this section by starting with 
the Schrédinger equation. However it can also be solved by algebraic 
methods, starting with the matrix theory of Chap. 6. The importance of 
the harmonic oscillator is so great that we shall also carry through. the 
matrix solution in Sec. 25. 


ASYMPTOTIC BEHAVIOR 


The force F = —Kz can be represented by the potential energy 
V(x) = $Kz? so that Eq. (8.5) becomes 
h2 a2 
- on Is + 4Kz’u = Eu (13.1) 


It is convenient in dealing with an equation of this type to rewrite it in 
dimensionless form. To this end we introduce a dimensionless inde- 
pendent variable § = az, and a dimensionless eigenvalue \, and attempt 
to put Eq. (13.1) in the form 


d? 
ae +A — &)u=0 (13.2) 
Comparison of Eqs. (13.1) and (13.2) shows that this is possible if 
cee fee eee 
a 72 nN ANK a (13.3) 


where w, = (K/m)' is the angular frequency of the corresponding classical 
harmonic oscillator. | 
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The solution of Eq. (13.2) is facilitated by first examining the domi- 
nant behavior of u in the asymptotic region §— +.! For sufficiently 
large £ it is apparent that u(£) = ée+#® satisfies Eq. (13.2), so far as the 
leading terms (which are of order £u) are concerned, when n has any 
finite value. The boundary conditions of Sec. 8 permit us to keep only 
the minus sign in the exponent. This suggests that it might be possible 
to find an exact solution of Eq. (13.2) of the form 


u(é) = H(é)e (13.4) 


where H(£) is a polynomial of finite order in & Substitution of Eq. 
_ (18.4) into Eq. (13.2) gives as the equation for H(é) 


H"” — 2§H’+ A’—-— DA =0 (13.5) 
where primes denote differentiation with respect to é. 


ENERGY LEVELS 
We now find a solution for H in the form 


H(é) = &(ao + aié + aot? + --- ) ao ~ 0 s>0O_ (13.6) 


This is necessarily finite for £ = 0. Equation (13.5) is to be valid for all 
values of &, so that, when Eq. (13.6) is substituted into it, the coefficient 
of each power of £ can be equated to zero. 


s(s — l)ag = 0 
(s + l)sa, = 0 
(s + 2)(s + Daz. — (28+ 1 — A)ao = 
(s + 3)(s + 2)a3 — (28+ 3 —A)ar = 0 


© © © © © © © © © 8 © © © © © © © © © © © © © © © © © © © 8 © 


(s+v+ 2)(s +» 4+ laze — (28+ 2n +1 - dA)a, = 0 


where v is an integer. Since a» cannot be zero, the first of Eqs. (13.7) 
tells us that s = 0 or s = 1. The second equation tells us that s = 0, 
or a; = 0, or both. Then the third equation gives us a2 in terms of ao, 
the fourth gives us a3 in terms of a1, and the general equation gives us 
a,+2 in terms of a,. 

The discussion of parity in Sec. 9 shows that u(£), and hence also 
H(&), can be chosen to be either even or odd in é. It then follows from 
Eq. (13.6) that a; and all the other odd-subscript coefficients are zero. 
The wave function is then even or odd according as s = O ors = 1. 

It follows from Eqs. (13.7) that the presence in the series (13.6) of 
a finite or an infinite number of terms depends on the choice of s and the 


(13.7) 


1 We follow the polynomial method of A. Sommerfeld, ‘‘Wave Mechanics,” p. 11 
(Dutton, New York, 1929). 
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eigenvalue \. If the series does not terminate, its dominant asymptotic 
behavior can be inferred from the coefficients of its high terms:. 


a 2 
al a ee a 


a, ywoumyp 


This ratio is the same as that of the series for é"e#? with any finite value of 
n. Equation (13.4) shows that this behavior for H violates the boundary 
conditions on u for large |é|. 

Thus the series (13.6) must terminate. This means that 


A=2%+2v4+1 


y must be an even integer, since a) ~ 0 and otherwise the even-subscript 
terms would form an infinite series. The index s can still be either 0 or 
1, and corresponding to these two values ) is equal to 2y + 1 or 2y + 3, 
where » is an even integer. We may express both cases in terms of a 
quantum number n: 


h=2n+1 E, = (n+ d)ha-. n=0,1,2,... (13.8) 


It follows from Eqs. (13.8) and (13.7) that n is the highest value of 
s + y in the series (13.6) for H. If we denote the corresponding poly- 
nomial by H,(£), we see that H, is of degree n in & and is wholly even or 
odd according as n is even or odd. Since e~?#’ is even and has no nodes, 
the corresponding eigenfunction u,(¢) has the parity of n and has n nodes. 


ZERO-POINT ENERGY 


The infinite sequence of energy levels (13.8) has the equal spacing postu- 
lated in 1900 by Planck, which is in agreement with the quantization 
rules of the old quantum theory. However, the finite value of the ground- 
state energy level 3hw., which is called the zero-point energy, is character- 
istic of the quantum mechanics and is related to the uncertainty principle 
in the same manner as is the finite lowest energy level for the square 
well with perfectly rigid walls (Sec. 9). The total energy is of order 
(Ap)2/m + K(Az)?, where Ap and Az are measures of the spreads in 
momentum and position, as defined in Sec. 12; if this is minimized, taking 
account of the uncertainty relation (3.1), it is easily seen that the mini- 
mum Ap is of order. (Kmh?)*, so that the minimum total energy is of 
order A(K/m)? or hag. 


HERMITE POLYNOMIALS 


The polynomial of order n that has the parity of n and is a solution of 
Eq. (13.5) with AX = 2n + 1 


H’! — 2tH' + 2nH, =0 | (13.9) 
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is called the nth Hermite polynomial H,(t). It is clear from the fore- 
going solution of Eq. (13.5) that these conditions define H, uniquely 
except for an arbitrary multiplying constant. It is not necessary, then, 
to use the recursion relations (13.7) to study the detailed properties of 
the H, if some other formulation of them can be found that is consistent 
with these conditions. A far more convenient formulation is actually 
available, which expresses the H, in terms of a generating function S(£,s). 


S(é,8) = ef —(s—-8)? = em e*+2st 


oe 


>) Hn(€) (13.10) 


n! 
n=0 


If the exponential in Eq. (13.10) is expanded out in powers of s and £, it is 
seen that a given power of s is associated only with powers of é that are 
equal to that power or less than it by an even integer. Thus H,(£) 
defined in this way is a polynomial of order n that has the parity of n. 

To show that this H, satisfies the differential equation (13.9), we 
differentiate both sides of Eq. (13.10) first with respect to £ and then with 
respect to s. 


30s —et4oet » Qsrt} = > gs” ; 

OE => 2s8e +2ef — C n! H,(&) pay an n! HT, (&) 
as ~s4oe — Yo (28 + 28)8" 

ag 7 (Bet ape =) SEE BAe 


=) Go H,(é) 


Equating equal powers of s in the sums of these two equations gives, 
respectively, 


Hi. = 2nH n-1 
Any. = 2tH,, es 2nHn_1 


The lowest-order differential equation involving only H, that can be 
constructed from Eqs. (13.11) is easily seen to be Eq. (13.9). Thus the 
H,(&) given by Eq. (13.10) are the Hermite polynomials. 

The relations (13.11) may be used for the calculation of the H, and 
their derivatives, or an explicit expression obtainable directly from the 
generating function may be used. If S(é,s) is differentiated n times with 
respect to s and s is then set equal to 0, Eq. (13.10) shows that the result 
is simply H,(t). Now for any function of the form f(s — £) it is apparent 
that 


(13.11) 


DISCRETE EIGENVALUES: BOUND STATES 71 


Thus 


ans an an 
VY pt? —_ ep-(e-8)? = (—1)re% — e-(s-8)? 
asx” gn © (—1)reF oe 


This gives an expression for the nth Hermite polynomial: 
Ha(®) = (—Ureh Set (13.12) 


The first three polynomials calculated from Eq. (13.12) are 
Ho(t)=1 Hilt) = 2¢ H(t) = 48 — 2 


HARMONIC-OSCILLATOR WAVE FUNCTIONS 


The generating function is also useful for the calculation of integrals 
involving the harmonic-oscillator wave functions (13.4): 


un(z) = N,H,(azx)e*"* (13.13) 
Suppose, for example, that we wish to normalize u,(z); this is equivalent 
to choosing the constant N, such that 

re 2 feo 
/ lun(x)|? dx = ey H,2(&)e-” dé = 1 
— a —7 


The integral on the right can be expressed as a series coefficient in the 
expansion of an integral containing the product of two generating 
functions. 


Sp st 28k o—t2+2tko— Ee? = oy stim pe Z - —p2 
/. eet ete Hite et de 2. om 1S. H,(t)H»(é)e-® dé 


n\m! 
(13.14) 
The integral on the left of Eq. (13.14) is readily evaluated directly to give 


<1 (2Qst)” 
4p2st — wh ( 
riett = x > a (13.15) 
If equal powers of s and ¢ are equated in the series on the right sides of 
Eqs. (13.14) and (13.15), we obtain the results 
[2 He @er# de = ah2"n! 
e - (13.16) 
[2, Ha@Ha(fe-® dt = 0 n#zm 


The first of Eqs. (13.16) tells us that the normalizing constant can 
be chosen to be 


N, = (asa) (13.17) 


i 
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where a constant multiplicative complex phase factor of unit magnitude 
is still arbitrary. The second of these equations tells us that un(z) and 
Um(x) are orthogonal to each other if n ~ m; this is in agreement with the 
general result obtained in Sec. 10 for nondegenerate energy eigenfunc- 
tions, since, in accordance with Eq. (13.8), E, # En if n # m, and so 
there is no degeneracy. 

The integral 

© N. bd N. m [* — £2 
[2 ut @)2un(2) dz = 258 [HOH (De® dé 


is typical of others that can be evaluated with the help of the generating 
function. The two series expressions for the integral, 


L) eo 


° —s2+ 2st — i? — 22 a s™t™ bad — 22 
| i Pa 2 asa [7 , Ha) Hn(Qer® dé 
and 
2” (sr+lyn + gnint1) 
} 2st = at ie ey 
mi(s + fer 2. 7 
may be equated term by term. With the~help of Eq. (13.17), we get 
a (? as * m=nt+1 
. a 2 
L. u ()aUm(x) de = 1 6) See (13.18) 
2 


otherwise 


CORRESPONDENCE WITH CLASSICAL THEORY 


Plots of the first six harmonic-oscillator wave functions are shown in 
Fig. 10. It is apparent that the position probability densities |u,|? 
associated with these stationary wave functions have little resemblance 
to the corresponding densities for the classical harmonic oscillator; the 
latter are proportional to (£2 — &)-?, where & is the amplitude of the 
classical oscillator whose energy is equal to the quantum-mechanical 
eigenvalue. The agreement between classical and quantum probability 
densities improves rapidly with increasing n. Figure 11 contains a plot 
of |u,|? forn = (solid curve) and of the density of a classical oscillator 
of total energy ¥%*fiw, (dashed curve). The agreement is fairly good, on 
the average, the principal discrepancy being'the rapid oscillations in |u,|?. 

The expectation value for the potential energy can be obtained 
from Eq. (7.6): 


(Vn = [* uf (2)}Katug(2) de 


2n+1 


= 4K ya = a(n + pho. = BE 
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Fig. 10 Energy eigenfunctions for the first six states of the harmonic oscillator. 
[After L. Pauling and E. B. Wilson, Jr., ‘Introduction to Quantum Mechanics,” 
pp. 74-75 (McGraw-Hill, New York, 1935).} 


where fz?|u,|? dz can be calculated with the help of the generating func- 
tion in analogy with the evaluation of Eq. (13.18). Thus, for any value 
of n, the average potential and kinetic energies are each half of the total 
energy, just as is the case with the classical oscillator. 

In similar fashion it can be shown that (x) = (p) = 0 for any 
harmonic-oscillator wave function, so that Eq. (12.1) tells us that 
(Ax)? = (x*) and (Ap)? = (p?). It is then easy to see that the uncer- 
tainty product is 


Az: Ap = (n+ $)h 
This has the minimum possible value 3% of Eq. (12.7) for the ground- 
state eigenfunction 


4 
u(x) = = —Jatst (13.19) 


which, as would be expected, is of the form of the minimum packet 
(12.11). Thus the minimum packet happens to be an eigenfunction of 
the harmonic-oscillator wave equation if its Az is properly related to K 
and m. 
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Fig. 11 The position probability density for the state n = 10 of a harmonic 
oscillator (solid curve) and for a classical oscillator of the same total energy 
(dashed curve). [After L. Pauling and E. B. Wilson, Jr., “Introduction to Quantum 
Mechanics,” p. 76 (McGraw-Hill, New York, 1935).] 


OSCILLATING WAVE PACKET 
In accordance with Kq. (10.18), the general solution of the time-dependent 
Schrédinger equation for the harmonic oscillator 


. O h? 0? 
th at ¥(z,t) = (- i aa + Kz") y(a,t) 


can be expanded in terms of stationary wave functions 
V(xt) =) Antin(a)e*atl® = etn YA tg (a)eminas (13.20) 
n=0 n=0 


where the A, are arbitrary constants. Thus, apart from the phase 
factor et, y(2z,t) is a periodic function of t with the period of the 
classical oscillator 27/w,. This suggests that it might be possible to find 
a solution in the form of a wave packet whose center of gravity oscillates 
with the period of the classical motion.! 


1B. Schrédinger, Naturwiss. 14, 664 (1926). 
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To investigate this possibility, we assume that at ¢ = 0 the y of. 
Eq. (13.20) has the form of the normalized minimum packet (13.19), 
except that the center of gravity is displaced in the positive x direction 
by an amount a. 


= 4 
W(@,0) = )) Antin(a) = 5 entero (13.21) 
n=0 . 


We can make use of the orthonormality of the u, to calculate a particular. 
coefficient A, by multiplying Eq. (13.21) through by uX(z) and integrat- 
ing over 7. 


ee er 
An = i un(x)y(z,0) dx = a. H,,(£)e7t8e-4E-to)? de 
f) = aa 


The integral on the right can be evaluated with the help of the generating 
function by equating term by term the two series expressions for the 
integral: 


J 


ce) s” bad 
fs ens +2te—(E-kot hb) de = )} = i, Hy(f)ee HoH de 
i : 


and 
te—itott+st> — zhe—tto? (s&o)" 
We wie Py zy nl 
On making use of Eq. (13.17), we obtain 
_ Ege 7th? 
An = “Grniji (13.22) 


Substitution of these A, into Eq. (13.20) gives 


Baqi aaa Se | 
¥(z,t) = et tho —fiwct > Hat) (2 Ege et)” 


n=0 


3 : ; 
a . - : 
= = exp (-22 i. 1£,? os Lael —_ LEEW 2twet + EE pe.) 


ac at 1 2 
= — exp[—3(& — £0 cos cod) 
— i(kwet + Eo sin wet — 1£o? sin 2w-t)] 
where the sum is evaluated with the help of the generating function 


(13.10). The absolute square of this wave function gives a position 
probability density 


ly (x,t) |? = = —~ a? (2—a cos wt)? 
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This shows that ¥ represents a wave packet that oscillates without 
change of shape about x = 0, with amplitude a and the classical 
frequency. 

As a — 0, ¥ approaches the lowest energy eigenfunction wo(x)e—*e, 
The larger a becomes, the larger the number of stationary states that 
contribute significantly to the packet, and the larger the quantum num- 
ber mo for which A, of Eq. (13.22) has a maximum. For n> 1, we can 
use Stirling’s formula to maximize In A,; neglecting terms of order In n 
and lower 


In A, ~ n(In & — $1n 2) — an(Inn — 1) 


Ka? (13.23) 
tho = Ske? = 5 


Thus the energy level E,, = (mo + $)fw., from whose neighborhood most 
of the contribution to y comes, is approximately equal to the energy 
Ka? of the classical oscillator that has the same amplitude. 


14CJSPHERICALLY SYMMETRIC POTENTIALS IN THREE DIMENSIONS 


It is generally impossible to obtain analytic solutions of the three- 
dimensional wave equation (8.2) unless it can be separated into total 
differential equations in each of the three space coordinates. It has 
been shown! that there are 11 coordinates systems in which the free- 
particle wave equation [Eq. (8.2) with V = 0] can be separated. One 
of the most important of these is the spherical polar coordinate system, 
in terms of which the rectangular coordinates are given by (see Fig. 12) 


x =rsin 6cos ¢ 
y =rsin 6sin d 


z2=rcos 80 


If the potential energy is spherically symmetric, so that V(r) = V(r) isa 
function only of the magnitude r of r measured from some origin, the wave 
equation can always be separated in spherical coordinates. Many prob- 
lems of physical interest can be represented exactly or approximately 
in terms of spherically symmetric potentials of various shapes. In this 
section we effect the separation and solve the resulting total differential 
equations in 6 and ¢. The next two sections deal with the solution of 
the radial equation for particular forms of V(r). 


1L, P. Eisenhart, Phys. Rev. 45, 428 (1934). See also L. Pauling and E. B. Wilson, 
Jr., ‘Introduction to Quantum Mechanics,’”’ App. TV (McGraw-Hill, New York, 1935). 
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Fig. 12 The relation between rec- 
tangular and spherical polar coor- 
dinates of a point P. 


SEPARATION OF THE WAVE EQUATION 


The wave equation (8.2) with a spherically symmetric potential energy 
may be written in spherical coordinates: 


h? ne 7) : ee fis 3? 
~~ Im 4 5 acd sin 6 06 — r? sin? 6 ee 6 ag? i 
+ V(r)u = Eu (14.1) 
We first separate the radial and the angular parts by substituting 
u(r,0,p) = R(r) ¥ (6,9) 
into Eq. (14.1) and dividing through by w. 


1 1 Oo oY 1 o7Y 
aly y [amo 35 (sin? a) + mara aeF | eo) 


Since the left side of Eq. (14.2) depends only on 7, and the right side 
depends only on @ and ¢, both sides must be equal to a constant that 
we callA. Thus Eq. (14.2) gives us a radial equation 


1d/f. dR 2m han ON 7 
(7 9) + (F [E — V(r)] I k=0 (14.3) 
and an angular equation 


1 @ 1 
sn 8 96 (sin 7] 3a) + —— sin? 6 9 3a? aes ‘AY =0 (14.4) 
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The angular equation (14.4) can be further separated by substituting 
Y (6,6) = 0(6)®(¢) into it and following the same procedure to obtain 


2, 
“= + v6 =0 | (14.5) 
1 df. ,de 


The ¢ equation (14.5). can be solved at once; its general solution may be ~ 
written 

&(¢) = Aetrts + Bethe vy #0 

&(¢) = A+ Bo y=0 
The requirement of Sec. 8 that 6(¢) and d@/d¢ be continuous through- 


out the domain 0 to 27 of ¢ demands that v be.chosen equal to the square 
of an integer. We thus replace Eqs. (14.7) by 


Sn(p) = (2m)-teime . (14.8) 


where now all physical meaningful solutions are included if m is allowed 
to be a positive or negative integer or zero;! the multiplying constant is 
chosen equal to (27)—? in order that ® be normalized to unity over the 
range of ¢. 


(14.7) 


LEGENDRE POLYNOMIALS 


Unless V(r) is specified, the farthest we can carry our treatment is the 
solution of the @ equation (14.6), where now vy = m?. It is convenient to 
substitute w = cos 6 for @ and put 


6(8) = P(w) 
when Eq. (14.6) becomes 


o [a — w?) o | Ee ( = oP =0 (14.9) 


Since the domain of 6 is 0 to x, the domain of wis 1 to —1. The proce- 
dure for solving Eq. (14.9) is in many respects similar to the solution of 
the harmonic-oscillator wave equation presented in Sec. 13 and will not 
be given in detail here.2 Since Eq. (14.9) is a second-order differential 
equation, it has two linearly independent solutions. Except for particu- 
lar values of \, both of these are infinite at w = +1 and, in accordance 


1 At the very slight risk of confusion with the mass of the particle, we make use of 
the customary symbol m for the quantum number associated with the coordinate ¢. 
2 For a complete discussion of this equation, see E. T. Whittaker and G. N. Watson, 
“A Course of Modern Analysis,” 4th ed., chap. XV (Cambridge, London, 1935). 
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with Sec. 8, are not physically acceptable. If, however, \ = I(1 + 1), 
where 1 is a positive integer or zero, one of the solutions is finite at w = +1 
(the other is not); this finite solution has the form (1 — w?)!!™! times a 
polynomial of order / — |m| in w and has the parity of 1 — |m|. 

The physically acceptable solutions of Eq. (14.9) when m = 0 are 
called the Legendre polynomials P,(w). Just as is the case with the 
Hermite polynomials, their properties may be discussed in terms of a 
generating function 


T(w,s) = (1 — 2sw + s?)-3 


> P;(w)s! s<1 (14.10) 


t=0 


Differentiation of the generating function with respect to w and s leads to 
relations that are analogous to Eqs. (13.11) for the Hermite polynomials. 


(1 a w)P, —IlwP, + IP 11 
(l + 1)Piu1 _ (21 + 1)wP, = IPii1 


(14.11) 


where primes denote differentiation with respect to w. The lowest-order 
differential equation involving only P; that can be constructed from Eqs. 
(14.11) is easily seen to be Eq. (14.9) with \ = (1 + 1) and m = 0. 

For m not necessarily equal to zero, Eq. (14.9) has physically 
acceptable solutions if \ = U(1 + 1) and |m| <l. These solutions, which 
are called associated Legendre functions, are expressible in terms of the 
Legendre polynomials: 


dim| 
dw! 


Py(w) = (1 — w?)ilm Pi(w) (14.12) 
This can be shown by substitution of Eq. (14.12) into the equation that 
' is obtained by differentiating |m| times the equation for P;(w). The 
generating function for the associated Legendre functions-is obtained by 
differentiating Eq. (14.10) |m| times with respect to w and multiplying 
by (1 — w?) ill 

(2|m|)!(1 — w2)tlmlslo) 


T'n(w,s) = 2iml(|m|)!(1 — Qsw + s?)lm+4 


= ) Prw)s (14.13) 
L=|m| 
SPHERICAL HARMONICS 


The angular part Yin(0,¢) of the complete wave function, which is a 
solution of Eq. (14.4) when \ = l(1 + 1), is called a spherical harmonic. 
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It.is apparent that 
Yin(8,¢) = NimPi™(c0s 8) ®n(4) (14.14) 


where ,,(@) is given by Eq. (14.8), and Nin is the normalization constant 
for the associated Legendre function. 

The same proof that was given in Sec. 10 for the orthogonality of 
the energy eigenfunctions may be used to show that solutions of Eq. 
(14.4) corresponding to different eigenvalues \ or / are orthogonal.. The 
eigenvalue J is, however, (21 + 1)-fold degenerate, since there exist 
linearly independent solutions Yim(6,¢) for this value of / and all integer 
values of m between +1 and —l. The choice of Eq. (14.8) for ®n(¢) 
makes these degenerate eigenfunctions orthogonal. We have then that 
the integral . . . 


cg 2a 7 1 Qn _: 
JF LP V0.6) Vm (0,6) sin odode = f°, [P VEYem dw dg 


vanishes unless | = Il’ and m = m’. It is interesting to note that there 
is no more orthogonality present than is necessary to make this integral 
vanish when it should. Thus the ¢ part of the integral vanishes when 
m # m’ without regard for the / values; the @ or w part of the integral 
vanishes only when | ¥ l' and |m| = |m’|, since for m # m’ the ortho- 
gonality is taken care of by the integration over ¢. 

The integral , 


a Pr(w)Py™(w) dw (14.15) 


can be evaluated in various ways, for example, by using the generating 
function (14.13) in a manner similar to that described in Sec. 13. As 
expected, the integral (14.15) vanishes unless J = I’, when it has the 
value (2/(21 + 1)][(l + |m])!/(1 — |m])!]; thus Nim is the reciprocal of the 
square root of this quantity, except for an arbitrary complex phase factor 
of unit magnitude. For later convenience we choose the spherical har- 
monics to be 

Yin(0,o) = € kon earl P,™(cos 0)e' (14.16) 
wheree = (—1)™form > Oande = lform <0. The first few spherical 
harmonics are 


1 3\h 
Yoo = Ga) Yio = (=) cos 0 
; ; 
Yee > (=) sin 6 e+ Yoo = (s5:) (3 cos? @ — 1) 


ie ; 15\} . 
Yost = + (2) sin 6 cos 6 e+ Yous = (ge) sin? @ e£2% 
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PARITY 


The concept of parity introduced in Sec. 9 can now be extended to three- 
dimensional problems of the type discussed in this section. Suppose that 
the position coordinate r is reflected through the origin so that r is replaced 
by —r; this corresponds to replacing x by —z, y by —y, and z by —z, or 
to replacing 6 by 7 — 6, @ by ¢ +7, and leaving r unchanged. It is 
clear that the only change in the wave equation (14.1) is that u(r,6,¢) is 
replaced by u(r, r — 0, @ + 7), the rest of the equation being unaffected. 
Then the discussion of Sec. 9 shows that orthogonal linear combinations 
of degenerate eigenfunctions can be found that have definite parities and 
that a nondegenerate eigenfunction must have a definite parity. 

The energy levels for a spherically symmetric potential are degener- 
ate at least with respect to the quantum number m, for 1 > 0. In this 
case, the degenerate eigenfunctions all have the same parity, which we 
now show to be the parity of /. When r is reflected through the origin, 
the radial part R(r) of the solution is unchanged, the ¢ part @(¢) given 
by Eq. (14.8) has the parity of |m|, and the @ part P,(cos @) has the 
parity of 1 — |m|, since P;"(w) is equal to an even part (1 — w?)!!"! times 
a polynomial in w that has the parity of 1 — |m| with respect to change 
in sign of w or cos 6. Thus Yin(9,¢), and hence u(r), has the parity of J. 


ANGULAR MOMENTUM 


The radial wave equation (14.3) may be rewritten in a form that resem- 
bles the one-dimensional wave equation (8.5). If we put R (r) = x(r)/r, 
the equation for the modified radial wave function x may be written 


h? dx U+ DAP] ) 
Fg ae + [ve + | x = Ex (14.17) 


Thus the radial motion is similar to the one-dimensional motion of a 
particle in a potential 


VO oe 


ion (14.18) 


The additional “potential energy” can be seen physically to be connected 
with the angular momentum in the following way. A classical particle 
that has angular momentum L about the axis through the origin per- 
pendicular to the plane of its orbit has the angular velocity w = L/mr? 
when its radial distance from the origin is r. An inward force 


2 


is required to keep the particle in this path; this “centripetal force” is 
supplied by the potential energy and hence adds to the V(r) that appears 
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in the radial motion an additional ‘centrifugal potential energy” L?/2mr?. 
This has exactly the form of the extra term in (14.18) if we put 


L = (li + 1% 


The foregoing physical argument for identifying the quantum 
number 1 with the angular momentum of the particle can be put in 
quantitative form by finding the operators that correspond to the three 
components of the angular-momentum vector. Classically, we have that 
L =r p, so that in quantum mechanics we take 


Le = YP: — Py = -in(u gs age.) 


Oz oy 
b 0 re] 
Ly = 2pz — XP2 = —th (2 Seo ae (14.19) 


: re) re) 
L, = py — yp: = —th («3 = A) 


Equations (14.19) can be transformed into spherical polar coordinates to 
give 


F : re) 0 
1, = ih (sin 6 3, + cot # cos #35) 
Lomas © ot pain ge (14.20) 
y= COs } a, c a6 ? 
. O 
L,= =U 


The operator that represents the square of the total angular momentum 
is then found from Eqs. (14.20) to be ; 


L? = L,? + dig? + Le 
‘ 1 @ : 0 1 o? 
nee | anne tei ee ee Ass 
: E 6 36 (sin 950) + sin®6 | ay) 
Comparison of Eqs. (14.21) and (14.4) shows that Yim(0,@) is an eigen- 
function of L? with the eigenvalue I(/ + 1)h?: 
L?¥im(0,¢) = Ul + 1)h?¥im(0,9) (14.22) 


In similar fashion, it follows from the structure of Eq. (14.8) and the last 
of Eqs. (14:20) that &,,(¢), and hence also Yin(6,$), iS an eigenfunction 
of L, with the eigenvalue mh: 


LY in(0,¢) = MhYin(0,%) (14.23) 


Thus the separation of the wave equation in spherical polar coordi- 
nates results in wave functions that are eigenfunctions of both the total 
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angular momentum and the component of angular momentum along the 
polar axis. The quantum number / that appears in Eq. (14.22) is called 
the azimuthal or orbital-angular-momentum quantum number. The quan- 
tum number m that appears in Eq. (14.23) is called the magnetic quantum 
number, since it is of importance in the theory of the Zeeman effect (see 
Sec. 31), which involves the component of angular momentum along the 
magnetic field (z axis). It should be noted that in general the wave 
equation cannot be separated in this way and angular-momentum eigen- 
functions obtained if the potential energy V(r) is not spherically sym- 
metric. This corresponds to the classical result that the angular momen- 
tum is a constant of the motion only for a central field of force (which is 
describable by a spherically symmetric potential). There is, however, 
the characteristic difference between classical and quantum theory that 
all three components of L can be precisely specified at once in the classical 
theory, whereas only L, and L? can, in general, be precisely specified at 
once in the quantum theory, since Y,,(0,¢) is not an eigenfunction of 
L, and L, (except for the case 1 = 0). The choice of the direction of the 
polar axis that distinguishes L, from L, and L, is, of course, completely 
arbitrary; it corresponds to the arbitrariness of the direction of space 
quantization in the absence of external fields in the old quantum theory. 


15LITHREE-DIMENSIONAL SQUARE WELL POTENTIAL 


We are now in a position to find the bound-state energy levels that 
correspond to particular choices of the potential energy V(r) and of the 
angular-momentum quantum number J, by solving the radial wave equa- 
tion (14.3). As a first example, we consider the square well potential of 
finite depth, for which V(r) = —Vo, r < a, V(r) = 0, r > a, where Vo 
is positive (see Fig. 13). A spherical region of this type in which the 
potential is less than that of the surroundings serves to attract a particle 
just as in the one-dimensional case considered in Sec. 9. 


V(r) 


Fig. 13 A spherically symmetric square well 
potential of depth Vo and radius a. 
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ZERO ANGULAR MOMENTUM 


When | = 0, it is easier to solve the wave equation in the form (14.17) 
than in the form (14.3). In this case, R(r) = x(r)/r, and the wave 
equation is 


— 5 ge — Vox = Ex r<a 
- 3 15.1) | 
i (15.1) 
“Rages ee 


The solution of Eqs. (15.1) is the same as that obtained for the finite 
potential step in Sec. 9, except for three points: First, the energy scale is 
lowered everywhere in the present problem by an amount Vo; second, the 
domain of r is 0 to +, in place of the domain — © to + for z; and 
third, the boundary condition that the wave function not become infinite 
at x = —o is now replaced by the same condition at r = 0. 

From the discussion of Sec. 9, it is apparent that the solutions of 
Eqs. (15.1) are 


x(r) = Asinar + Beosar 


a= eel Tr < a 
h (15.2) 
: 3 
x(r) = Ce-#r B= CH | ; r>a 


where we are interested in bound-state energy levels for which E < 0. 
The requirement that R(r) be finite at r = 0 demands that we set B = 0 
in the first of Eqs. (15.2). Thus the solution has the form of the odd 
parity solution of the one-dimensional problem. The energy levels are 
obtained by equating the two values of (1/x)(dx/dr) at r = a [this is 
equivalent to making (1/R)(dR/dr) continuous there] and are given by 
solving 


acotaa = —s (15.3) 


which is the same as Eq. (9.6). Then it follows from the discussion of 
Fig. 9 that there is no energy level unless Voa? > 1?h?/8m; there is one 
bound state if 1?h?/8m < Voa? < 9rh?/8m, etc. 


INTERIOR SOLUTIONS FOR ARBITRARY / 


For nonzero values of I, it is more convenient to work with the original 

radial equation (14.3) than with the equation for x. If we put p = ar, 

where a@ is defined in Eq. (15.2), the wave equation for r < a becomes 
2 

@R , 2dk [2-22] 2-0 


‘dp 2 op - 


(15.4) 
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The strong resemblance between Eq. (15.4) and Bessel’s equation sug- 
gests that R(r) can be expressed in terms of Bessel functions. This is, in 
fact, the case; if we define the “spherical Bessel function” j:(p) that is 
regular at p = 0 by! 


jile) = (5) S144 (0) (15.5) 


where J is an ordinary Bessel function of half-odd-integer order, it is 
easily verified that 7:(p) satisfies Eq. (15.4). In similar fashion, the 
“spherical Neumann function’’ is 


rule) = (—a( F140 


It can be shown? that Ji,;(p), where | is a positive or negative 
integer or zero, is expressible as a sum of products of sin p and cos p with 
polynomials of odd order in p~?. In particular, explicit expressions for 
the first three j’s and n’s are 


joe) = —* no(o) = ——* 
P inp. c¢ | si 
jal) = SP — SEP alo) = — SP 
(15.6) 
j2(e) = (3 — -) sin p — Lao p 
p> p p? 
iG) =— (3 _ 5) e08 ne 
p> p p? 
The leading terms for small p are* 
(21 — 1)! 


RS aie tee mee rae 


1 This definition and the properties of the 7; and m are taken from P. M. Morse 
and K. U. Ingard, “Theoretical Acoustics,” pp. 337-338 (McGraw-Hill, New York, 
1968). 


2G. N. Watson, “‘Theory of Bessel Functions,’ 2d ed., p. 52 (Macmillan, New York, 
1945). 


3 Equations (15.7) are useful scncosmnatinis for p? somewhat less than 41 + 6 and 2, 
respectively (Watson, op. cit., p. 44). 
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and the leading terms in the asymptotic expansions are! 


Jule) > = 008 [p — 30+ Le] 


; (15.8) 
n(p) —— - sin [p — $(1 + 1)z] 
pe p 
Some properties of the j’s and n’s are 
Ji0?(0)p? dp = 2p*[Jo?(o) + no(o)j1(0)] 
Jno?(p)p? dp = 2p*[n0"(p) — jo(e)mi(p)] 
‘ ‘ 1 , 
m_1(e)ji(o) — m(p)ji-1(o) = of L>0 (15.9) 
‘ d d . 1 
jilp) dp ™) — ni(p) ap 2) re pe 
The following are properties of both the j’s and the n’s: 
F j 2i+1. 
ji-r(e) + jrsi(o) = = jie) L>0 
4 ie) = sa Wile) — + Dinalo)] 
dp! ~ +1 Ji-1\P 1414p 
: I+1. 
= jir(e) — : jie) 
5, lol*ti(o)] = oslo) 1>0 (15.10) 
© tp-tu(o)] = — pHs) 
dp t / +1 
Si1(e) dp = —jo(e) 
Sj0(0)p? dp = p7j1(p) 
Si? (p)p? do = 407[92?(0) — jr—r(0)J14-1(0)] L>0 
Since R(r) must be finite for r = 0, the desired solution for r < a is 
R(r) = Aji(ar) . (15.11) 


EXTERIOR SOLUTIONS FOR ARBITRARY | 


The wave equation for r > a can be put in the form (15.4) if we redefine 
p to be 78r, where 8 is given in Eq. (15.2). Since the domain of p does not 


1 Equations (15.8) are useful approximations for p somewhat larger than 3I(/ + 1) 
(Watson, op. cit., p. 199); however, the magnitudes (although not the phases) of ji 
and n; are given to good approximation by (15.8) if p is somewhat larger than l, which 
is approximately the value of p for which the magnitude of j; is greatest. 
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now extend in to zero, there is no reason why m cannot appear in the 
solution: The linear combination of j; and 7 to be selected will be deter- 
mined by the asymptotic form, which must fall off exponentially for 
large r. This suggests that we define spherical Hankel functions 


hi (p) = jrle) + ini(p) 


: ; (15.12) 
— hi (p) = jrle) — imp) 
which from Eqs. (15.8) have the asymptotic forms 
hy (p) >} eftedatne) 
p>™ p 
(15.13) 


hy) (p) er. entle—40 4D] 
po p 


It can be shown that the asymptotic expansions, of which Eqs. (15.13) 
_ give the leading terms, contain no terms for which the exponent has the 
opposite sign to that given. 

The desired solution for r > a is then 


R(r) = Bhi (pr) = Bljibr) + in(2Br)] (15.14) 
The first three of these functions are 
1 
(D (7, a Ee 
ho (78r) Br B 
mse) = if +h) esr (15.15) 
: Br = Pr? 


ha (¢Br) = €; + se : \ew 


Br B 2p 2 B r3 
ENERGY LEVELS 


The energy levels are obtained by requiring that (1/R)(dR/dr) be con- 
tinuous at r = a. When this condition is applied to the interior solution 
(15.11) and the exterior solution (15.15) with 1 = 0, we obtain Eq. (15.3). 
This may be written as 


2m V ,a? 
h? 


EcotE = —7 f+ 9? = (15.16) 
where, as in Sec. 9, we have put = aa and 7 = Ba. The same condi- - 
tion applied to the solutions for 1 = 1 reduces, with the help of Eqs. 
(15.6) and (15.15), to 


t 1 1 2mV a? 
ae Pee aee eee 2472 = a 
n. 7 h 


(15.17) 


~ 
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Equations (15.17) may be solved numerically or graphically, by the 
methods indicated for the solution of Eqs. (15.16) in Sec. 9. In general, 
there is no degeneracy between the eigenvalues obtained from the solution 
of equations like (15.16) and (15.17) for various values of 1. 

It is easy to see how many energy levels Eqs. (15.17) give for various _ 
values of Voa? without going through the numerical work. A new level 
appears whenever 7 is zero or cot é is infinite. This occurs at ¢ = 7, 
2x, .... Thus there is no energy level with 1 = 1 when 


22 
2 us 
Vea? < 2m 


there is one bound state with 1 = 1 if +h?/2m < Voa? < 2n°h2/m, ete. 

The smallest value of Voa? for which there exists a bound state with 
L = 1 is greater than the corresponding value of Voa? for 1 = 0; this is 
reasonable from a physical point of view. The interpretation in Sec. 14 
of the / term in the radial wave equation as an additional potential energy, 
which corresponds to the repulsive “centrifugal force,” suggests that a 
particle possessing angular momentum requires a stronger attractive 
potential to bind it than a particle with no angular momentum. Indeed, 
it turns out that the minimum square well potential “strength” Va? 
required to bind a particle of orbital-angular-momentum quantum num- 
ber / increases monotonically with increasing 1.1 


16LITHE HYDROGEN ATOM. 


The potential energy V(r) = —Ze?/r, which represents the attractive 
coulomb interaction between an atomic nucleus of charge +Ze and an 
electron of charge —e, provides another wave equation that can be 
solved analytically. This problem is of direct physical interest, since 
apart from relativistic effects (see Chap. 13), the calculated energy 
eigenvalues are in agreement with the observed energy levels of the 
hydrogen atom (Z = 1), the singly charged helium ion (Z = 2), ete. 


REDUCED MASS 


The Schrédinger wave equation developed in Sec. 6 describes the motion 
of a single particle in an external field of force. Now, however, we are 
interested in the motion of two particles (nucleus and electron) that are 
attracted to each other by a force that depends only on the distance 
between them. The form of the wave equation to be used for two parti- 
cles is suggested by the extension of the wave equation from one to three 


1 It can be shown that bound states appear with zero energy for a particular / value 
when [(1/R)(dR/dr))-.c = —(l + 1)/a; for 1 > 0 this is equivalent to the condition 
ji-i(€) = O where now & = (2mVoa?/h?)}. 
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dimensions that was given in Sec. 6. This extension involved making 
the wave function depend on the three rectangular coordinates x, y, and 
z, instead of just on x, and introducing the momenta corresponding to 
the new coordinates as they appear in the classical expression for the - 
energy. 

A similar extension from three to six rectangular coordinates leads 
directly to the Schrédinger wave equation for two particles of masses 
m, and me: 


.» O h? o? 3? o? 
th Fy; V(©1,Y1,21,02,Y2,22,t) = [= Oni (a aye + =) 
h? 0? 0? o? 
a 2m? (e: a OY2? - 3) 


+ V (21,4 1,21,%2,Y2,22,t) V(21,Y1,21,L2,Y2,22,t) (16.1) 


where the potential energy is assumed to depend in an arbitrary manner 
on all six coordinates and the time. If now the potential energy depends 
only on the relative coordinates, so that V = V(a1 — 22, y1 — Y2, 21 — 22); 
an important simplification can be made. We define relative coordinates 
zx, y, 2 and coordinates of the center of mass X, Y, Z by 


t+ 1y— Xe Y= Y1 — Y2 Z2= 21 — 22 


MX = M121 + MoXLe MY = M1Yy1 + MeoYe2 MZ 


M321 + Mez2 
(16.2) 


Here, M = m, + m; is the total mass of the system. Equation (16.1) 
can be rewritten in terms of the new coordinates . 


, _[ Ara , a , a) 
ag =| au (or + ars + az) 


BEE his gO Vs easy |e 63) 
~ Qu \an? © ay? * 2? sh de ; 


where 


me MyM ; 
alana Pass (16.4) 


is called the reduced mass. 
Two separations of the wave Sdduticn (16.3) can now be made. 
First, the time dependence can be separated out, as in Sec. 8, and, second, 


a separation can be made into a product of functions of the relative 
coordinates and center-of-mass coordinates. The process is straight- 
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forward and simple and results in 


¥(x,y,2,X,Y,Z,t) = u(x,y,2z)U(X,Y,Z)e Haren 


2 
7 5 Vu + Vu = Bu (16.5) 
PRU eau 
om YU = 


where the V? operators in the second and third equations imply differ- 
entiation with respect to the relative and center-of-mass coordinates, 
respectively. The second of Eqs. (16.5) describes the relative motion of 
the two particles and is the same as the equation for the motion of a 
particle that has the reduced mass uz in an external potential energy V. 
The third of Eqs. (16.5) tells us that the center of mass of the system of 
two particles moves like a free particle of mass M. 

In the hydrogen-atom problem, we shall be interested in the energy 
levels EH associated with the relative motion. In this case, the reduced 
mass y is only slightly smaller than the electronic mass, since atomic 
nuclei are far more massive than electrons. 


ASYMPTOTIC BEHAVIOR 


The separation of the relative motion in spherical coordinates is made 
as in Sec. 14. The radial equation that corresponds to the angular- 
momentum quantum number / is then 


he 1 d G an) Ze 4 Wt DM 


-pag("G)-=Rt R=ER (16.6) 


r 
where £ < 0 for a bound state.!. We follow the polynomial method used 
in the treatment of the harmonic-oscillator equation that was given in 
Sec. 13, and we first attempt to rewrite Eq. (16.6) in dimensionless form 
by introducing a dimensionless independent variable p = ar. Unlike Eq. 
(13.1), however, where the leading term for large x was the potential- 
energy term 3Kz2x?, the leading term in Eq. (16.6) for large r is the eigen- 
value term EH. We therefore choose a so that the E term becomes a 
fixed number; this makes the dominant asymptotic behavior of the solu- 
tion independent of the eigenvalue. We rewrite Eq. (16.6) as 


1d/.,dk 1 i+) = 
where the particular choice of the number 7 for the eigenvalue term is 


arbitrary but convenient for the following development. Comparison 


1E. Schrédinger, Ann. Physik 79, 361 (1926). 
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of Eqs. (16.6) and (16.7) shows that 


; 8u| E| 2uZe? Ze? bu 3 ; 
2 = =_ OO =_lUC OO —_——, 
a ae r oa? " (ats) (16.8) 


As with the harmonic-oscillator equation, we first find the dominant 
behavior of R(p) in the asymptotic region p— ©. Tor sufficiently 
large p, it is apparent that R(p) = p*e* satisfies Eq. (16.7) so far as the 
leading terms (which are of order R) are concerned, when n has any 
finite value. This suggests that we look for an exact solution of Eq. 
(16.7) of the form 


R(p) = F(p)e (16.9) 


where F(p) is a polynomial of finite order in p. Substitution of Eq. 
(16.9) into Eq. (16.7) gives as the equation for F(p) 


Rey 6 # 1) F! + lene x ae F =0 (16.10) 
p p p 
where primes denote differentiation with respect to p. 


ENERGY LEVELS 


We now find a solution for F in the form 


F(p) = p*(ado + Qip + aoxp? +--+: +) ; 
= p*L(p) ao # 0 s > 0 (16.11) 


This is necessarily finite for p = 0. Substitution of Eq. (16.11) into Eq. 
(16.10) gives as the equation for L 


pL" + p[2(s + 1) — plL’ + [fo — 8 — 1) | 
+s(s4+ 1) -1l4+1))]L =0 


If p is set equal to zero in this equation, it follows from the form of D 
implied by Eq. (16.11) that s(s + 1) —Ui+1) =0. This quadratic 
equation in s has two roots: s = 1 and s = —(1+ 1). The boundary 
condition that R(p) be finite at p = 0 requires that we choose s = l. 
The equation for L then becomes 


pL" + [20+ 1) — pl’ +A —1— LE =0 (16.12) 


Equation (16.12) can be solved by substituting in a power series of 
the form indicated by Eq. (16.11). The recursion relation between the 
coefficients of successive terms of the series is readily seen to be 


yvtl+1—A27 


Qi. = G+ DG PoP) ay (16.13) 
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If the series does not terminate, its dominant asymptotic behavior can 
be inferred from the coefficients of its high terms: 


Qy41 1 
p+ 


ay woe yp 


This ratio is the same as that of the series for p"e* with any finite value of 
n. Equations (16.9) and (16.11) show that this behavior for L violates 
the boundary condition on # for large p. 

Thus the series for L must terminate. If the highest power of p in 
L is p*’(n’ > 0), we must choose equal to a positive integer n,! such 
that 

A=n=n'+1+1 (16.14) 


n’ is called the radial quantum number and n the total quantum number. 
Since n’ and / can take on positive integer or zero values, n can have the 


values 1, 2,.... The energy eigenvalues are given by Eq. (16.8) 
— _ig| a. — ware 
E, = |E,| = in? (16.15) 


in agreement with the old quantum theory and with experiment. Unlike 
the square well potential problem considered in Sec. 15, the coulomb 
field problem gives rise to an infinite number of discrete energy levels 
extending from —yZ%e4/2h? up to zero, for any finite value of Z. This 
is due to the slow decrease in magnitude of the coulomb potential at 
large r. 


LAGUERRE POLYNOMIALS 


The physically acceptable solutions of Eq. (16.12) with \ = n may be 
expressed in terms of the Laguerre polynomials L,(p), which can be defined 
in terms of a generating function 


e 8/(1—a) 


a eae BG 
-) Lal) 54 e<l (16.16) 
q=0 q 


Differentiation of the generating function with respect to p and s leads to 
relations that are analogous to Eqs. (13.11) for the Hermite polynomials 
and (14.11) for the Legendre polynomials: 


Ly ~ Wy = —Qgr 
Devi = (2q oe aoe p) Lg — qa 


1 The result that the allowed values of ) are integers, rather than multiples of integers, 
derives from the choice of } for the eigenvalue term in the dimensionless radial wave 
equation (16.7). 


(16.17) 
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The lowest-order differential equation involving only L, that can be 
constructed from Kgs. (16.17) is easily seen to be 
pLy + (1 — p)Ly + qh, = 0 (16.18) 


Equation (16.18) resembles Eq. (16.12) but is not quite the same. We 
define the associated Laguerre polynomial 


Li (e) = = p Ea(e) (16.19) 


On differentiating Eq. (16.18) p times, it is seen that L,?(p) satisfies 
pL?” + (p +1 — p)L?’ + (q — p)L,? = 0 (16.20) 


Comparison of Eq. (16.12) with X = n and Eq. (16.20) shows that the 
desired polynomial solutions are the associated Laguerre polynomials 
Lit} (e), which are of order (n + 1) — (21 + 1) = n — 1 — 1in agreement 
with Eq. (16.14). 

Differentiation of Eq. (16.16) p times with respect to p gives the 
generating function for the associated Laguerre polynomials: 


,  (—s8)Penes/A-) Lg? (p) : 
Unlors) = payer > 2 “es (16.21) 


The following explicit expression may be verified by substituting it into 
Kq. (16.21), with n + 1 = qand 21+ 1 = p, and interchanging the order 
of the two summations: 


2141 k+2l+1 [(n + D)!]?p* 
LisH) = > (—1)t iar £1 + Be 
(16.22) 


HYDROGEN-ATOM WAVE FUNCTIONS 


The radial wave function is of the form e~#’p'L74(9). The normalization 
constant may be found by using the generating function to evaluate 
the integral 


[> ero LLzA(o) to" dp = I ET (16.23) 


Thus the normalized energy eigenfunctions for the hydrogen atom are 


Unim(7,0,) = Rru(r) Yim(0,¢) 


Bat) = = {(7) Sate eis] PEO (16.24) 
n? QZ, | 


ay = — 5 p=—r 
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where Yim(8,) is the normalized spherical harmonic given in Eq. (14.16); 
a is the radius of the first (circular) Bohr orbit for hydrogen (Z = 1) 
in the old quantum theory. The energy levels (16.15) may be written 
Ze? . 

ayn? 


The first three radial functions, which are found from Eqs. (16.22) 
and (16.24), are 


Ruo(r) = (2) Qe~arla 


ao 


Z\i Z 
R2a(r) = mm) (2 — 2) e—2t/2aq 


i 
Rar) = (s:.) ee e—21/2a9 


A much more complete set of these functions, with graphs of some of 
them, is given by Pauling and Wilson.? 

It is interesting to note that each of the eigenfunctions for which 
l = 0 has a discontinuous gradient at r = 0, since dRy»o/dr ~ 0 there 
and Yoo is independent of and ¢. This is a consequence of the infinite 
potential energy at that point, as can be shown by means of a limiting 
process similar to that used in Sec. 8 to derive the boundary conditions 
at a perfectly rigid wall. 


E, = — 


DEGENERACY © 


The energy eigenvalues (16.15) depend only on 7m and so are degenerate 
with respect to both! and m. Thus, for each value of n, can vary from 
0 to n — 1, and for each of these / values, m can vary from —I to +. 
The total degeneracy of the energy level E, is then 

n~-1 

y (21+1) =2 ee 


i=0 


+n =n’? 


It follows from the discussion of Sec. 14 that the degeneracy with 
respect to m is characteristic of any central force field, for which V 
depends only on the radial distance r from some point. The / degener- 
acy, however, is characteristic of the coulomb field, as distinguished from 
most other central force fields.2, In some problems, such as the motion of 
the valence electron of an alkali atom, the potential energy of the electron 


1 Pauling and Wilson, op. cit., sec. 21. 


2 The special symmetry of the coulomb field that leads to this degeneracy is discussed 
in Sec. 30. 
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is central but only approximately of the coulomb form. This prevents 
the n energy levels that have the same total quantum number n and 
. different 1 from being coincident, so that the nth hydrogen-like level 
splits up into n distinct levels. If also some external field (such as a 
magnetic field) that destroys the spherical symmetry is imposed, the 
(21 + 1)-fold m degeneracy disappears, and the nth hydrogen-like level 
is split up into n? distinct levels. 

The existence of degenerate energy eigenvalues means that linear 
combinations of the corresponding eigenfunctions are solutions of the 
wave equation with the same energy. In the case of the m degeneracy, 
such linear combinations of the spherical harmonics Y7,,(6,¢) can be 
found that correspond to a new choice of the polar axis. It is reasonable 
to expect that linear combinations of the degenerate hydrogen-atom 
eigenfunctions that have the same n and different / exist that also corre- 
spond to some new choice of the coordinates. This is, in fact, the case, 
since it turns out that the hydrogen-atom wave equation can be separated 
in parabolic coordinates. In general, degeneracy will occur whenever 
the wave equation can be solved in more than one-way (in different 
- coordinate systems, or in a single coordinate system oriented in dif- 
ferent ways), since if there were no degeneracy the wave functions 
obtained in the different coordinate systems would have to be identical 
except for a multiplying constant, and that is usually not possible. For 
a general central field, an exception occurs when / = 0, since then the 
wave function is spherically symmetric and has the same form for all 
orientations of the polar axis, so that there is no degeneracy. A similar 
exception occurs in the hydrogen-atom problem when n = 1, in which 
case it turns out that the solutions obtained by spherical and parabolic 
separation of the wave equation are identical. 


SEPARATION IN PARABOLIC COORDINATES 


The parabolic coordinates £, 7, ¢ are given in terms of the spherical 
polar coordinates by the relations 


t=r—z=r(l — cos 6) 

n=r+z=r(1 + cos 6) (16.25) 

¢?=¢ 
The surfaces of constant ¢ are a set of confocal paraboloids of revolution 
about the z or polar axis, with focus at the origin, that open in the direc- 
tion of positive z or @ = 0. The surfaces of constant 7 are a similar set 
of confocal paraboloids that open in the direction of negative z or @ = 7. 


The surfaces of constant ¢ are the same as in the spherical coordinate 
system: planes through the polar axis. 


lt 
I 
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The wave equation for the hydrogen atom in parabolic coordinates 


~ 35 lea Lae (#5) + a (>) 


1 0’u)  2Ze? 
-~ “at u=E E 
+234 Ppt = Bu <0 (16.26) 


The separation is accomplished by substituting 


u(én,d) = f(E)g(n)®(4) 


into Eq. (16.26) and dividing ee by u; the @ watt of the equation 
separates at once!: 


se (1862) 
f dg g dyn \" dn 
4uZe*tin Ss Qu|E|En 7 _id 

TRE Fn) a Badge 06-27 
Since the left side of Eq. (16.27) depends only on £ and 7, and the right 
side only on ¢, both sides must be equal to a constant that we call m?; in 
accordance with the discussion of Sec. 14, this gives normalized ¢ solu- 
tions that are the same as (14.8): 


@n(p) = (Qr)—teime m=0,+1,+2,... (16.28) 


The rest of Eq. (16.27) can be separated into ¢ and-y parts: 
he df\ m ul E| we 
fae (: z) ag ~ one & + 
1 d dg m? plE 
7 a £( #) ~ fy seed "| =» (16.29) 


where the separation constant v is to be determined by the boundary 
conditions. Thus the equations for f and g are 


df m? , wlElé oe 2 
ae (ae) — (Ge + ae — Ge +) = 9 


ulE|n _ 
in (nds) ~ (Gp + Se Jono 


Since these two equations are of the same form and differ only in their 
constant terms, it is sufficient to solve one of them. 


(16.30) 


1K. Schrédinger, Ann. Physik 80, 437 (1926); P. S. Epstein, Phys. Rev. 28, 695 (1926); 
I. Waller, Z. Physik 38, 635 (1926). 
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ENERGY LEVELS . 


The first of Eqs. (16.30) may be solved by the method used to-solve 
ee The substitution ¢ = af puts it into the dimensionless form 


of M1 1 m? 
Sf oo Se Ne a 16.31 
if we choose the parameters @ and ); to be given by 
Qu|E 1 (uZe? 
a = ay | AL = 2(45 = ’) (16.32) 


The second of Eqs. (16.30) is also of the form (16.31) if we put ¢ = ay 
with @ given by (16.32); Ai is replaced by 


Ae = (16.33) 

We now treat Eq. (16.31) as we did (16.7). The asymptotic 
behavior is dominated by the factor e+#*, where we must take the minus 
sign in the exponent. The series that multiplies this starts with a term 
¢*, where it is readily shown that s = +4m. We therefore substitute 


f(s) = eHetelL(y) (16.34) 
into (16.31) and obtain as the equation for L 
SL!" + (|m| +1 — 9)L’ + [a ~ 4(\m| + DIL = 0 A835) 


As with Eq. (16.12), the nonterminating solutions for L cause the wave 
function (16.34) to become infinite for large ¢. The terminating solu- 
tions are the associated Laguerre Poynorlals comparison of Eqs. (16. 20) 
and (16.35) shows that they are L' nit |mi(¢), Where 


mi = da — (|m| + 1) (16.36) 


is a positive integer or zero. 
In similar fashion, the solution of the 7 equation shows that the 
number 


m2 = de — 3(|m| + 1) (16.37) 
is a positive integer or zero. From Eqs. (16.36) and (16.37) we obtain 
Ar +A. =m +n2+ |m +l=n . (16.38) 


where n is a nonzero positive integer. The energy levels are given by 
combining Eqs. (16.32), (16.33), and (16.38): 


- = h7a 2 ee uZ 24 
Ba = Wea a oper 
in agreement with Eq. (16.15). The energy level E,, is degenerate, since 
according to (16.38) there are various ways in which the three quantum 
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numbers 71, 2, and m can be combined to make up n. For m = 0, there 
are n ways of choosing n; and m2; for |m| > 0, there are two ways of 
choosing m (= +|m|), and n — |m| ways of choosing n: and m2. Thus 
the total degeneracy of the energy level £, is 


n—-1 


n+2) (2 — Im) =n +2 [nm — 1-20 =) | = 


|m|=1 


in agreement with the earlier result. 


“WAVE FUNCTIONS 


It is clear from the foregoing discussion that the unnormalized hydrogen- 
atom wave functions in parabolic coordinates are 


Unnym(E,7,) re e-aCet (Eq)almlpiol Cat) iiml (ane 
seek nett Oe ote att 
h?(n1 + ne + |m| + 1) 


For a particular energy level EH, and magnetic quantum number 
m (n > |m|), the parabolic quantum numbers n; and nz can be chosen 
such that m1 + nz = n — |m| — 1, that is, in n — |m| different ways. 
Similarly, for given n and m, the azimuthal quantum number / in the 
spherical solution can be chosen such that |m| <1 <n — 1, and so also 
in n — |m| different ways. Thus the n — |m| products of the & and 7 
functions are linear combinations of the n — |m| products of the r and 
6 functions. . 

The ground-state energy level provides a particularly simple illus- 
tration of the connection between the parabolic and spherical solutions. 
In this case, ny = nz = m = 0, and the parabolic solution is simply 
eHZeXEtn/20? Also, n = 1, 1 = m= 0, and the spherical solution is 
e-#Zer/ = Tt is apparent from Eq. (16.25) that these two solutions are 
identical. 


Qa 


PROBLEMS 


1. Apply the Bohr-Sommerfeld quantization rules (see Sec. 2) to the determination 
of the energy levels of a harmonic oscillator and of the circular orbits in a hydrogen 
atom. Compare with the results obtained in this chapter. 

2. What is the order of magnitude of the spread of quantum numbers and energies 
of the states that contribute significantly to the oscillating-wave-packet solution for 
the harmonic oscillator? 

3. Use the generating function for the Hermite polynomials to evaluate 


| oa un (x) 2%tm(x) dx 


where the u’s are normalized harmonic-oscillator wave functions. 
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_4. Use the generating function for the Legendre polynomials to evaluate 
i 
i Pi(w)Pv(w) dw 


5. Obtain an approximate analytic expression for the energy level in a square well 
potential (1 = 0) when Va? is slightly greater than 1?h?/8m. 

6. Show that for a square well potential the limiting values of Voa? that just bind 
new energy levels with an / value greater than zero are given by h?z?/2m, where the 
numbers z are the nonvanishing solutions of the equation j:-1(z) = 0 (see footnote 1, 
page 88). 

7. Assume that the interaction between the neutron and the proton that make up 
a deuteron can be represented by a square well potential with a = 2.00 X 107!* cm. 
If the lowest (J = 0) energy level of the system is — 2.23 Mev (million electron volts), 
calculate V» in Mev to three significant figures. How does the answer compare with 
that which would be obtained from the approximate formula derived in Prob. 5? 

8. Consider Eq. (14.17) with | = 0 and V(r) = —Voe7/*. Change variables from 
r to z = e7’¢, and show that Bessel’s equation results. What boundary condi- 
tions are to be imposed on x as a function of z, and how can these be used to determine 
the energy levels? What is the lower limit to V> for which a bound state exists? 

9. Find expressions for the eigenfunctions and energy levels of a particle in a two- 
dimensional circular box that has perfectly rigid walls. 

10. It is shown in Sec. 9 that a one-dimensional square well potential has a bound 
state for any positive Voa?, and in Sec. 15.that a three-dimensional spherically sym- 
metric square well potential has a bound state only for Voa? > 2?h?/8m. What is 
the analogous situation for a two-dimensional circularly symmetric square well poten- 
tial? What, if any, is the physical significance of these results? 

11. The Schrédinger equation for a rigid body that is constrained to rotate about a 
fixed axis and that has a moment of inertia J about this axis is 


ay _ ay 


where y¥/(¢,t) is a function of the time ¢ and of the angle of rotation, ¢, about the axis. 
What boundary conditions must be applied to the solutions of this equation? Find 
the normalized energy eigenfunctions and eigenvalues. Is there any degeneracy? 
12. Find the energy levels of a three-dimensional isotropic harmonic oscillator 
[V(r) = 4Kr*] by solving the wave equation in cartesian coordinates. What is the 
degeneracy of each level? Show that this equation can also be separated in spherical 
and in cylindrical coordinates. 
13. Show that the expectation value of the potential energy of an electron in the 
nth quantum state of a hydrogen atom is —Z%e?/aon?. From this result, find the 
expectation value of the kinetic energy. 
ay Find the normalized hydrogen-atom wave functions in parabolic coordinates for 
= 2, m=0. Express them as linear combinations of the corresponding wave 
Feacisnt in spherical coordinates. 
15. Discuss the parities, if any, of the bycrpen-atore wave junctions’ in parabolic 
coordinates. 


5 
Continuous Eigenvalues: 
Collision Theory 


Problems for which the energy eigenvalues are continuously distributed 
usually arise in connection with the collision of a particle with a force 
field. The method of approach is different from that employed in the 
preceding chapter. There the boundary conditions at great distances 
were used to determine the discrete energy levels of the particle. In a 
collision problem, the energy is specified in advance, and the behavior of 
the wave function at great distances is found in terms of it. This 
asymptotic behavior can then be related to the amount of scattering of 
the particle by the force field. 

As in Chap. 4, the relatively few exact solutions that are obtained 
here are of wider application than might at first seem to be the case, 
since they can serve as foundations for approximate calculations on more 
complicated systems. It is interesting to note that the study of collisions 
is particularly important in connection with atomic nuclei and elementary 
particles, where relatively little information can be obtained in other 
ways. 
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Fig. 14. A one-dimensional square potential 
barrier of height V> and thickness a. 


17.0NE-DIMENSIONAL SQUARE POTENTIAL BARRIER 


We consider first the one-dimensional collision of a particle with the 
square potential barrier V(x) shown in Fig. 14. In this problem we are 
interested in a particle that approaches from the region of negative x 
and is reflected or transmitted by the barrier. In the corresponding 
classical problem, the particle is always reflected if its energy is less than 
that of the top of the barrier, and it is always transmitted if its energy is 
greater. We shall see that,.in the quantum problem, both reflection and 
transmission occur with finite probability for most energies of the particle. 
Because of the lack of symmetry between positive and negative x that 
is introduced from the beginning, it is disadvantageous to deal with 
solutions that have definite parities, and so there is no reason for making 
V(z) symmetrical about z+ = 0, as was done in Sec. 9. We assume, 
therefore, that V(x) = 0 for x <0 and «>a, and V(z) = Vo for 
0 < x < a, where Vo is positive. 


ASYMPTOTIC BEHAVIOR 


We are interested in representing a particle that approaches from the 
left with energy E > 0 and may be turned back by the potential barrier 
or penetrate through it. Thus the asymptotic behavior [in the regions 
where V(x) = 0] is as follows: For x < 0, we want the wave function to 
represent a particle moving to the left (reflected particle) as well as to © 
the right (incident particle); for s > a, we want the wave function to 
represent only a particle moving to the right (transmitted particle). 

A particle in a force-free region that is moving in a definite direction 
with a definite energy necessarily has a definite momentum and hence 
can be represented by a one-dimensional momentum eigenfunction 
u(x) « e'?2/* if the particle is moving in the positive x direction with the 
momentum p, and u(x) « e~‘?*/4 if the particle is moving in the negative 
zx direction with the same energy. Thus, since the wave equation in the 
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regions where V(x) = 0 is 


h? du 
— Om dat ~ Bu 
our asymptotic solutions are 
u(x) = Ae* + Bet z<0 
(2) (17.1) 
u(x) = Cet r>a 


where k = p/h = +(2mE/h?)' is the magnitude of the propagation 
number. The solutions (17.1) are appropriate for the force-free regions 
that are external to any scattering potential, whether or not it has the 
simple form shown in Fig. 14. 


NORMALIZATION 


The physical meaning of the coefficients A, B, and C can be inferred by 
substituting (17.1) into the one-dimensional form of the probability 
current density given by Eq. (7.3). 


S(z) = v(|Al|? — |BI?) a<o0 
S(xz) = v|C|? z>a 


where v = ik/m is the speed of a particle with propagation number k. 
Since these expressions are independent of x, the discussion of Sec. 7 
shows that they may be interpreted as the net flux (positive to the right) 
in the two regions. This interpretation is consistent with the statement 
above that A, B, and C are the amplitudes of the incident, reflected, 
and transmitted wave functions, respectively. 

The absolute normalization of the wave functions (17.1) is unim- 
portant for this problem; this is because we are interested only in the 
ratios of |B|? and |C|? to |A|?, which are, respectively, the reflection and 
transmission coefficients for the barrier. It is sometimes convenient, 
however, to normalize the incident wave function to unit flux; this cor- 
responds to taking A = 1/v'. Such a normalization must not be inter- 
preted as indicating that u(x) represents more than one particle; rather 
it means that we choose a large enough number of systems [each described 
by u(z)] that are identical and independent in the sense of Sec. 7, so that 
the total incident flux in all of them is unity. A more precise but some- 
times less convenient normalization would assume a one-dimensional 
“box” of length L with periodic boundary conditions and require that 


Te ju(x)|? dx = 1. 


(17.2) 


SCATTERING COEFFICIENTS 


The character of the solution inside the potential barrier depends on 
whether FE is greater or less than Vo. Suppose first that E > Vo, 
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so that we can define a propagation number inside the barrier: 
a = [2m(E — V>)/h*}*. Then the solution inside is 


u(x) = Fe + Ge-** O0O<zx<a (17.3) 


The continuity of u and du/dz at z = 0 and x = a required by the 

boundary conditions provides four relations between the five coefficients. 

We can eliminate F and G and solve for the ratios B/A and C/A. 
B (k? — a?)(1 os e?tas) 

A (k+ a)? — (k — a)*e** 

CL 4kae*a—*)e 

A (kh +a)? — (kh — a)*e%a0 


The absolute squares of the ratios (17.4) are the scattering (reflection 
‘and transmission) coefficients 


(17.4) 


By _ a 4E(E — Vo) | 

Al E - (k? — a)? sin? =| — E + “Vsin? aa | (7.5) 
\S sem fe i (k? — a)? sin? aa |? _ [| re V2 sin? aa |“ 
A Ak?a?* 7 4E(E — Vo) 


It is readily verified from (17.5) that |B/A|? + |C/A|? = 1, as would 
be expected. 
Equations (17.5) show that the transmission coefficient approaches 


\—1 ; 
(1 + me) (17.6) 


when the particle energy approaches the energy of the top of the barrier 
(E— V.). For increasing E (E > Vo), the transmission coefficient 
oscillates between a steadily increasing lower envelope and unity (see 
Fig. 15). There’is perfect transmission when aa = 7, 2r, ... , that is, 
whenever the barrier contains an integral number of half wavelengths.’ 
Interference phenomena of this type are well known in the transmission 
of light through thin refracting layers. 

The reflection and transmission coefficients for 0 < E < Vo are 
most easily obtained by replacing a by 78 in Eqs. (17.4), where 


ie EY ee ay 


h2 
The result for the transmission coefficient is 
Cre. Vo? sinh? Ba |“! 
ef = E +e | (17.7) 


1 This effect also occurs when Vo < 0, in which case the square barrier becomes a 
square well. The scattering coefficients are given by (17.5) if the sign of Vois changed 
there and in the expression for a. 


104 » 3 ' QUANTUM MECHANICS 


Fig. 15 ‘The transmission coefficient of a square barrier as a function of particle 
energy for mV oa?/h? = 8. 


which decreases monotonically from the value (17.6) as E decreases 
below Vo. This behavior is often referred to as tunneling through the 
barrier. When fa >> 1, the transmission coefficient (17.7) becomes very 
small and is given approximately by : 


16E(Vo — E - 

Oy ) @-Ma 7 (17.8) 
Figure 15 is a plot of the transmission coefficient computed from Eqs. 
(17.5) and (17.7) for a rather “opaque” barrier: mV qa2/f2 = 8. 


SCATTERING OF A WAVE PACKET 


The foregoing treatment of scattering makes use of stationary solutions 
of the time-independent wave equation (8.2) that correspond to par- 
ticular values of the incident momentum. It is also possible to work 
with the time-dependent equation (6.16), although this is more difficult 
since even in the one-dimensional case we must deal with a partial 
differential equation in two independent variables, x and t. There is, 
however, the advantage that we can then study the motion of a wave 
packet of the general form discussed in Sec. 12 and see in what way it is 
reflected from and transmitted by the potential. 
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There are two ways of approaching this problem. The solution 
can be expanded in energy eigenfunctions, as in Eq. (12.15), and the 
summation carried out explicitly as in the transition from Eqs. (12.18) 
and (12.19) to (12.20). Although this could be done analytically in 
Sec. 12 for a free particle, it is generally impossible when a potential is 
present. The other approach consists in performing a numerical inte- 
gration of Eq. (6.16), choosing small but finite time intervals, and 
approximating the differential equation in the time by a difference 
equation. Even in the one-dimensional case this is feasible only 
with a high-speed computer, and special computational techniques must 
be invented to make the problem manageable. Some typical graphs 
are reproduced here from a very large number that form the basis of com- 
puter-generated motion pictures.? 

In these computations, units are chosen such that 4 = land m = 3. 
The unit. of length can still be chosen arbitrarily and is slightly longer 
than the total length of the abscissa shown in each graph. Then the time 
and the reciprocal of the energy are each expressed in terms of a unit that 
is the square of the length unit. The incident wave packet is of the 
gaussian form (12.11), with Ar = 0.035 and (p) = 502, 70.77, and 1007. 
The potential has breadth a = 0.064 and strength Vo = +(70.71)?, so 
that the mean energy of the incident wave packet is 3|Vo, |Vol, and 2| Vo. 
Succeeding frames of the motion picture are shown in Fig. 16 when the — 
mean energy of the wave packet is half the barrier height, and in Fig. 17 
when the mean energy is equal to Vo. Figures 18 and 19 show the 
attractive case in which the mean energy is half of the well depth and 
equal to the well depth, respectively. In all these cases, |p|? is plotted 
as ordinate and x as abscissa, and the potential width and position are 
indicated; the number in each frame denotes the time in arbitrary units. 
Figure 17 is especially interesting, since it shows how the probability 
decays slowly and bounces back and forth within the barrier when. the 
mean kinetic energy of the particle in the barrier is zero. 


18LICOLLISIONS IN THREE DIMENSIONS 


We are primarily concerned in this chapter with collisions in three 
dimensions, in which a particle collides with a fixed force field or two 
particles collide with each other. It was shown in Sec. 16 that the prob- 
lem of the nonrelativistic motion of two particles, when the only forces 
present depend on their relative positions, can be broken up into two one- 
particle problems, of which one describes the motion of the particles 
relative to each other or to their center of mass, and the other describes 
the free motion of the center of mass. Although the center of mass canbe 


1A. Goldberg, H. M. Schey, and J. L. Schwartz, Am. J. Phys. 35, 177 (1967). 
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Fig.16 Gaussian wave-packet scattering from a square barrier when the mean energy 
of the packet is half the barrier height. 


taken to be at rest in calculating the energy levels of the internal motion, 
as in Sec. 16, it has a definite motion in a collision that cannot be ignored 
in calculating the outcome of such an experiment. This is because the 
customary laboratory procedure consists in bombarding a particle that 
is initially at rest with another particle that carries the total energy 
E, = E + E’ of Eq. (16.5). Thus the energy E£ of the relative motion of 


ae se 


Fig. 17 Same as Fig. 16, with the mean energy equal to the barrier height. 


the two particles is different from the bombarding energy Eo, and the 
observed scattering depends on whether the struck particle or the center 
of mass is initially at rest. 

We call the coordinate system in which the bombarded particle is 
initially at rest the laboratory coordinate system and the coordinate system 
in which the center of mass of the two colliding particles is (initially 
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Fig. 18 Gaussian wave-packet scattering from a square well when the mean energy . 
of the packet is half the well depth. 


and always) at rest the center-of-mass coordinate system. It is evidently 
easier to calculate the result of a collision experiment in the center-of- 
mass system than in the laboratory system, since only 3 degrees of free- 
dom appear in the former as compared with 6 in the latter system. The 
collision process in the center-of-mass system may then be thought of as 
one in which a particle that has the reduced mass p = mym2/(m + mez) 
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Fig. 19 Same as Fig. 18, with the mean energy equal to the well depth. 


of Eq. (16.4) and an initial velocity v collides with a fixed scattering 
center [see the discussion of Eq. (18.9) below]. The distribution in angle 
of the scattered particles will be affected by the transformation between 
the center-of-mass coordinate system, in which the calculations are made, 
and the laboratory coordinate system, in which the observations are 
made. 
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SCATTERING CROSS SECTION 


The angular distribution of particles scattered by a fixed center of force 
or by other particles is conveniently described in terms of a scattering 
cross section. Suppose that we bombard a group of n particles or scatter- 
ing centers with a parallel flux of N particles per unit area per unit time 
and count the number of incident particles that emerge per unit time in 
a small solid angle Aw) centered about a direction that has polar angles 
4) and ¢o with respect to the bombarding direction as polar axis. This 
number will be proportional to N, n, and Aw, provided that the flux is 
small enough so that there is no interference between bombarding parti- 
cles and no appreciable diminution of the bombarded particles by 
their recoil out of the target region, and provided also that the bombarded 
particles are far enough apart so that each collision process involves only 
one of them. 

Then the number of incident particles that emerge per unit time in 
Aw, can be written 


nNo (60,0) Aw (18.1) 


where the proportionality factor oo(@0,¢0) is called the differential scatter- 
ing cross section. Since (18.1) has the dimensions of reciprocal time, 
70(90,¢0) has the dimensions of an area. (80,60) Awo is equal to the cross- 
sectional area of the parallel incident beam that contains the number of 
particles scattered into Aw» by a single target particle or scattering center. 
The integral of oo(6@0,¢0) over the sphere is called the total scattering cross 
section 


To = Joo(4o,¢0) dwo (18.2) 


For the collision of a particle with a fixed scattering center, the 
definition (18.1) of differential scattering cross section is equally valid in 
the laboratory and center-of-mass coordinate systems, since a scattering 
center that is fixed has an infinite effective mass and so the center of 
mass of the system does not move. For a collision between two particles 
of finite mass, however, the differential cross section (18.1) applies in 
general only to the laboratory coordinate system and to the observation 
of the scattered incident particle. It does not describe the observation of 
the recoil bombarded particle in the laboratory system, although it is 
of course possible to obtain a differential cross section for the recoil 
particle from oo(@0,¢0). In the center-of-mass system the differential 
cross section o(6,¢) may be defined in analogy with (18.1), where again 
the scattered incident particle is the one that is observed and the flux N 
of the incident particle is computed with respect to the bombarded 
particle, not the center of mass. Since in this coordinate system the two 
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particles move in opposite directions away from each other after the 
collision, it is clear that the differential cross section for observation of 
the recoil bombarded particle in the direction 6, ¢ is just ¢o(r — 0, @ + =). 


RELATIONS BETWEEN ANGLES IN THE LABORATORY AND CENTER-OF-MASS SYSTEMS 


The relation between the differential cross sections and angles in the 
laboratory system and in the center-of-mass system can be found by 
translating the laboratory system in the direction of the incident particle 
with sufficient speed to bring the center of mass to rest. Figure 20a 
shows a particle of mass m, and initial speed v striking a particle of 
mass mz, that is initially at rest; the center of mass moves to the right with 
the speed v’ = my/(m; + m2), as may be seen from the conservation of 
momentum. Thus in the center-of-mass system the particles of masses 
m, and m2 approach the center of mass with speeds 


Mov 
m, + Mme 


and v’, respectively. If the collision is elastic, they evidently recede from 
the center of mass after the collision with the same speeds (see Fig. 20b). 
It follows from the geometry of the situation that 6 and ¢ are related to 
69 and ¢o by 


Za 


wv’ =y—y = 


v’’ cos 6 + v’ = v1 cos 


v’’ sin 0 = 0; sin 0 (18.3) 
= do 
From the first two of Eqs. (18.3) we obtain, on elimination of 1, 
_ sing _v mm 


Equations (18.3) and (18.4) can be valid for inelastic collisions 
as well. Consider a general binary collision (for example, a nuclear 
reaction) in which a particle of mass m, strikes a particle of mass m, 
initially at rest and, after the collision, particles of masses m3 and m, 
emerge, where m, + m, = m3; + m,. If also an amount of energy, Q, 
is converted from internal energy to kinetic energy of the emergent 
particles (Q is positive for exothermic and negative for endothermic 
collisions) and the particle of mass m; is observed, the first of Eqs. (18.4) 
is still valid. In this case y is still equal to the ratio of the speed of the 
center of mass in the laboratory system to the speed of the observed 
particle in the center-of-mass system. However, 7 is no longer mi/m:2 
but can be shown to be given by 


m4mM3 E ) 


tie (Fe E+Q Ce) 
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Fig. 20 (a) Laboratory coordinate system, in which the target particle 
of mass mz is initially at rest. (6) Center-of-mass coordinate system, 
in which the center of mass is initially and always at rest. (c) Vector 
addition of velocity of center of mass in laboratory system (v’) to velocity 
of observed particle in center-of-mass system (v’’) to give velocity observed 
in laboratory system (vi); if v’’ < v’, 0 cannot exceed the angle sin7! 


(v"’ /v'). 
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where E = mymv?/2(m, + mz) is the energy initially associated with the 
relative motion in the center-of-mass system [see the discussion of Eq. 
(18.9) below]. 


RELATION BETWEEN CROSS SECTIONS 


The relation between the cross sections in the laboratory and center-of- 
mass coordinate systems can be obtained from their definitions, which 
imply that the same number of particles are scattered into the differential 
solid angle dw about 60, ¢o as are scattered into dw about 0, ¢. 


70(90, $0) sin 0 dA ddo = o(0,¢) sin 6 dé do (18.6) 


With the help of the last of Eqs. (18.3) and the first of Eqs. (18.4), Eq. 
(18.6) gives 


(1 + y? + 27 cos 6)# 
|1 + y cos 6] 


where in general ¥ is given by Eq. (18.5). It should be noted that the 
total cross section is the same for both laboratory and center-of-mass 
systems and also for both the outgoing particles, since the total number 
of collisions that take place is independent of the mode of description of 
the process. 


g0(80,0) = o(,¢) (18.7) 


DEPENDENCE ON 7Y 


For y < 1, Eq. (18.4) shows that 69 increases monotonically from 0 to 
m as 6 increases from Oto. For y = 1, 6) = 46 and varies from 0 to 4m 
as @ varies from 0 to 7; in this case 


F0( 90,0) = 4cos O00 (280, do) 


and no particles appear in the backward hemisphere in the laboratory sys- 
tem. For y > 1, 6 first increases from 0 to a maximum value sin! (1/7), 
which is less than $7, as 6 increases from 0 to cos~! (—1/y); 60 then 
decreases to 0 as @ increases further to x. In this case o(8,¢0) is usually 
infinite at the maximum value of 60, although this singularity gives a 
finite contribution to the total cross section; no particles appear beyond 
the maximum 6) in the laboratory system. The two values of @ that 
give rise to a particular value of 6) between 0 and sin— (1/7) can be 
distinguished by the energy of the observed particle, which is greater 
for the smaller 6. 

This last case (y > 1) is illustrated schematically in Fig. 20c. The 
resultant of the velocity v’’ of the observed particle in the center-of-mass 
system and the velocity v’ of the center of mass in the laboratory system 
gives the velocity v; of the observed particle in the laboratory system. 
The locus of the terminal points of v; when its origin is at the point O 
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is the circle of radius v’’. Thus, whenv” < v’, the angle 6 of the resultant 
vi with the bombarding direction cannot exceed the angle 


ld 
; v ; 1 
sin~! —- = sin7!- 
v Y 


As the ratio y = v'/v’’ decreases, the circle gets relatively larger and the 
angular range of v, increases. 

The use of geometrical relationships in the foregoing discussion is 
valid in a quantum-mechanical system as well as in a classical system. 
This is because they are essentially relations between momentum vectors 
that are applied in the asymptotic region where the particles need not be 
precisely localized in space and hence can have definite momenta. 

It is interesting to note that the difference between laboratory and 
center-of-mass systems is negligible in the collisions of electrons with 
atoms, because of the large mass ratio of the colliding particles. In 
nuclear collisions, however, the difference between the two coordinate 
systems is usually significant. 


ASYMPTOTIC BEHAVIOR 


The differential scattering cross section o(6,¢) in the center-of-mass 
coordinate system can be found from the asymptotic form of the solution 
of the second of Eqs. (16.5), 


2 
— Ls Vu+ Vu = Eu (18.8) 
ib 


which is the wave equation for the relative motion. ‘The wave function 
u may be written as a function of the angles 0, ¢ of Fig. 20b and the radial 
distance r between the two particles. From Eq. (16.4) the reduced mass 
is p = mym2/(m, + m2). The energy E associated with the relative 
motion is easily seen from Fig. 20 to be 
me 

where Eo is the initial energy of the bombarding particle. It is interesting 
to note that E is equal to the kinetic energy of a particle whose mass is 
the reduced mass p and whose speed is the relative speed v. Thus we can 
think of Eq. (18.8) as representing the elastic collision of a particle of mass 
uz, initial speed v, and kinetic energy E = guv”, with a fixed scattering 
center that is described by the potential energy V(r); then r is the vector 
distance from the fictitious particle » to the origin of the scattering — 
potential. 

As in See. 17, the scattering is determined by the asymptotic form of 
u(r,0,¢) in the region where V = 0. When the colliding particles are 


CONTINUOUS EIGENVALUES: COLLISION THEORY 115 


far apart, we want wu to contain a part that represents an incident particle 
of mass » moving in a particular direction with speed v, and a part that 
represents a radially outgoing particle: 


u(7,0,46) —» Ale + 1-¥f(6,4)e] == (18.10) 
T—> 0 

The first term in Eq. (18.10) represents a particle moving in the positive 
z direction, or along the polar axis @ = 0, since z = r cos 6; it is an infinite 
plane wave of the form of the momentum eigenfunction (11.2), where the 
propagation vector k has the magnitude & and is directed along the polar 
axis. The second term in Eq. (18.10) represents a particle that is moving 
radially outward; its amplitude depends on @ and ¢ and is inversely 
proportional to r, since the radial flux must fall off as the inverse square 
of the distance. It is readily verified that Eq. (18.10) satisfies the wave 
equation (18.8) asymptotically through terms of order 1/r in the region 
in Which V = 0, for any form of the function f(6,¢). 


NORMALIZATION 


The physical meaning of the coefficient A dnd the angular function f 
can be inferred from a calculation of the particle flux, as in Sec. 17. 
A straightforward substitution of Eq. (18.10) into Eq. (7.3), however, 
yields interference terms between the incident and scattered waves that — 
do not appear in most experimental arrangements; that they do not 
appear can be seen in the following way. 

In practice, the incident and scattered particles are separated from 
each other by collimating one or the other. Suppose, for example, that 
the experimental arrangement is as shown schematically in Fig. 21, so 
that the bombarding particles from the source S are collimated by dia- 


D D 


a@ 


Fig. 21 Schematic diagram of a laboratory arrangement for the measurement of 
scattering, in which there is no interference between the incident and scattered 
waves at the point of observation, P. 
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phragms DD into a fairly well-defined beam. Such a collimated beam 
is not an infinite plane wave of the form e** but can be made up by 
superposing infinite plane waves that have propagation vectors of 
slightly different magnitudes and directions. The total angular spread 
in radians will be of the order of the ratio of the wavelength of the particle 
to the diameter of the collimating aperture and so can be made extremely 
small in a practical case. Now f usually does not vary rapidly with angle, 
so that the small directional spread of the incident propagation vectors 
does not affect f significantly. Thus at the point of observation, P, only 
the f term is present, and it is essentially the same as that which appears 
in Eq. (18.10). The incident flux can be calculated from the plane-wave 
term alone, since if we go far enough from the scattering region, the f 
term can be made negligible. Thus in the region of observation, the 
interference terms are a consequence of the idealization implicit in assum- 
ing an infinite plane wave in Eq. (18.10), and they usually have no 
physical significance.! 

Substitution of the two terms of Eq. (18.10) separately into Eq. (7.3) 
shows that the incident flux is of magnitude v|A|? along the polar axis 
and that the leading term in the scattered flux is of magnitude 


rf A]71F(0,9)? 


r2 


along the outward radius. From the definition of cross section, it follows 
that 


o(6,¢) = |f(6,¢)|? (18.11) 


As discussed in Sec. 17, the choice of the coefficient A is unimportant 
so far as the calculation of the scattering is concerned. The wave func- 
tion may be normalized to unit incident flux by choosing A = 1 /v}, or it 
may be normalized by making f|u|? d’r = 1 over a large box that has 
periodic boundary conditions. We shall often simply set A equal to 
unity. 


19_ISCATTERING BY SPHERICALLY SYMMETRIC POTENTIALS 


The asymptotic behavior of the wave function determines the differential 
scattering cross section but cannot itself be found without solving the 
wave equation (18.8) throughout all space. As was the case with 
the calculation of energy levels considered in Chap. 4, this can be done 
only when the wave equation is separable, and a particular case of great 
physical interest is that in which the potential energy is spherically 
symmetric. We assume here that V is a function only of 7, and we find 


1 For a somewhat exceptional case, see the discussion at the end of Sec. 20. 
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the connection between the solutions separated in spherical polar coordi- 
nates and the asymptotic form (18.10); this procedure is called the 
method of partial waves. 

In the remainder of this chapter we shall not, for the most part, 
distinguish between collisions of a particle with a fixed scattering center 
and collisions between two particles treated in the center-of-mass coordi- 
nate system. 

Elastic scattering, for which V is real, is considered in this section. 
The situation in which V is complex is discussed in Sec. 20 and is of 
interest since certain inelastic processes can be described approximately 
in this way. 


ASYMPTOTIC BEHAVIOR 


It is apparent that the problem now possesses symmetry about the polar 
axis, so that u, f, and o are independent of the angle ¢. The general 
solution of Eq. (18.8) can then be written as a sum of products of radial 
functions and Legendre polynomials (see Sec. 14); for later convenience 
we write this in the form 


u(r,6) = ») (21 + 1)2'Ri(r)Pi(cos @) 
1=0 


(21 + 1)#r—'x:(r) Pifcos @) (19.1) 


ms 


i 


0 


where P, is the Legendre polynomial of order 1, and x; satisfies the 
equation 

2. 

dx, J — um - 22) = 0 


dr? 


2uE\+ 2uV 
e = Fir) u(y) = 2 __,o 


The boundary condition at r = 0 that R, be finite or x; vanish deter- 
mines the asymptotic form of the solution (19.2) except for an arbitrary 
multiplicative constant. 

In order to find the general nature of this asymptotic behavior, we 
consider r to be so large that the U and / terms in Eq. (19.2) can be 
neglected. Then the solution of Eq. (19.2) is one of the forms e+**". To 
get a better approximation, we put 


xi(r) = A exp Lf f(r’) ar’ | extkr (19.3) 


where A and a are constants. The first exponential is assumed to be a 
slowly varying function of r for large r, which implies that f(r) falls off 


(19.2) 
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more rapidly than 1! as r— ©. Substitution of (19.3) into (19.2) 
gives the following equation for f: 


fps ong =u) + C2 2 we (19.4) 


where the prime denotes differentiation with respect to r. If now W(r) 
falls off like r-* for large r (s > 0), the last term on the left side is the 
leading term, and f also falls off like r-*. In this case, x: varies like e+ 
for large r if s > 1, since then the integral in the exponent of Eq. (19.3) 
converges for large r._ If, on the other hand, W falls off like an exponen- 
tial or error function of r (which implies that / = 0), the first and third 
terms on the left side of Eq. (19.4) may both have to be considered. It 
can then be shown without difficulty that x, again varies like e+*r for 
large r. The coulomb field, for which U and W vary like r— for large r 
regardless of the value of J, is the only case of physical interest that 
requires special attention and will be discussed in Sec. 21. 

The asymptotic form of x:(r) can then be written quite generally 


xi(r) Ep A; sin (kr + 8;) | (19.5) 


where thus far A; and 6, can be complex. The solution of (19.2) that 
vanishes at r = 0 is unique except for a multiplying constant. It can 
be shown that this solution is real everywhere if it starts out to be real at 
r = 0, since k, U, and / are all real. Thus 5; must be real, although A} 
need not be. This being the case, it is readily verified that the total 
radial flux of particles through a large sphere vanishes: 


lim 2nr? f" 8, sin 6 d6 = 0 | (19.6) 
where S, is the radial component of the vector (7.3) calculated by sub- 
stituting u(r,6) from Eq. (19.1) into it. This means that no sources or 
sinks of particles are present, and the particles that are scattered radially 
outward are supplied by the incident plane wave. 


DIFFERENTIAL CROSS SECTION 


It is convenient to redefine the amplitude A}; and phase angle 6, that 
appear in Eq. (19.5) in terms of a somewhat more specialized problem. 
It will be assumed that U(r) can be neglected for r greater than some 
distance a; in cases of practical interest, a may be small enough so that 
the / term in (19.2) is not negligible. We can then divide the domain of 
r into three regions. In the first, U(r) cannot be neglected, and the form 
of Ri(r) depends on U. In the second, U(r) can be neglected, but the / 
term in (19.2) is not negligible. Then the most general form for R,(r) 
that is real (except possibly for a complex multiplying constant) is shown 
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in Sec. 15 to be 
Ri(r) = Ajfeos 67:(kr) — sin d(kr)] (19.7) 


where 6; is real. In the third region, the / term as well as U is negligible, 
and &,(r) is obtained from the asymptotic formulas (15.8): 


Ri(r) —> (kr)-1Ar sin (kr — le + 51) (19.8) 


Equations (19.5) and (19.8) agree if A, = kA; and 8; = 6, + dlr. 

We now wish to identify the asymptotic form of (19.1) with (18.10). 
To do this, we require an expansion of e** = ercos® in Legendre poly- 
nomials:! ; 


eikreos® — ») (21 + 1)2j:(kr) Pr(cos 6) (19.9) 
1=0 
Substituting the asymptotic form of (19.9) into (18.10) with A = 1, and 
equating this to the asymptotic form of (19.1), we obtain — 


Dy (21 + 1)i(kr)-! sin (kr — $lr)Pi(cos 0) + r—'f(@)e* 
1=0 


= Y (21 + 1)@#Aj(kr)-! sin (kr — $l + 6:)Pi(cos 8) 
=0 

When the sine functions are written in complex exponential form, the 

coefficients of e* and of e-**" on the two sides of this equation must be 


equal to each other: 


2ilf(0) + Yi (21+ L)te-**P,(cos 6) 
i=Q 


(21 + 1)7Aje**r-*™ Pi(cos @) (19.10) 


Mis tM 


y (21 + 1)z'e#* P, (cos 6) (21 + 1)7Aie**r-#'*) P, (cos 6) 
1=0 


l=0 


Since these are true for all values of 6 and the Legendre polynomials are 
orthogonal to each other, the second of Eqs. (19.10) becomes 


A, = e% 


Substitution of this into the first of Eqs. (19.10) gives for the scattering 
amplitude 


f(0) = (21k)! Y (QL + 1)(e%* — 1) Pi(cos 6) (19.11) 
l=0 


1G. N. Watson, ‘Theory of Bessel Functions,” rev. ed., p. 128 (Macmillan, New 
York, 1944). 
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Thus the differential cross section is 


(0) = |f(I? = 75 


) (21 + 1)e* sin 6; Pi(cos @) : (19.12) 
1=0 


It is useful to note that Eqs. (18.10), (19.11), and (19.12), which 
refer to the asymptotic behavior of the wave function, are also valid 
when special relativity is taken into account. So long as we are dealing 
with motion in the center-of-mass coordinate system, which can be 
described by a function u of the relative coordinates, we can specify 
the scattering in this way in terms of the 6, where #k is the relative 
momentum. This is true even when the interaction cannot be described 
by a potential. 


TOTAL ELASTIC CROSS SECTION 


The total elastic cross section is the integral of Eq. (19.12) over the 
sphere. Because of the orthogonality of the Legendre polynomials, it 
contains no products of factors involving different values of I. 


ye Be f * a(o) ancode = a 2 (21 + 1) sin? 8, (19.13) 


The total cross section can also-be related to f(0). It follows from 
the generating function (14.10) for the Legendre polynomials that 
P,(1) = 1 for all J, so that Eq. (19.11) gives for 6 = 0 


f(0) = (21k) Y (21+ 1)(e%* — 1) 
1=0 
Comparison with Eq. (19.13) then shows that 
27 Ar 
TS es [f(0) — f*(0)] = 7 Im f(0) (19.14) 


where Im denotes the imaginary part of the expression that follows. 

The physical interpretation of Eq. (19.14) is as follows: In order for 
scattering to take. place, particles must be removed in an amount pro- 
portional to o from the incident beam, so that its intensity is smaller 
behind the scattering region (6 ~ 0) than in front of it. This can occur 
only by interference between the two terms in the asymptotic expression 
(18.10). Since such an interference term must be a linear function of the 
forward-secattered amplitude, we expect a relation of the general form of 
Eq. (19.14). An actual calculation of this interference term shows that 
Eq. (19.14) holds much more generally: when f depends on ¢ as well as on 
9, and when o includes inelastic scattering and absorption as well as 
elastic scattering. (See the end of Sec. 20.) 
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PHASE SHIFTS ; 


The angle 6; is called the phase shift of the Ith partial wave, since according 
to (19.8) it is the difference in phase between the asymptotic forms of the 
actual radial function R,(r) and the radial function j,(kr) in the absence 
of scattering potential (U = 0). The phase shifts completely determine 
the scattering, and the scattering cross section vanishes when each of the 
5, is 0° or 180°. 

It should be noted that the derivation of (19.11) is valid whether or 
not there exists the assumed radius a beyond which U(r) is negligible, 
provided that U(r) falls off more rapidly than 1/r. However, the method 
of partial waves is most useful for computing scattering cross sections if 
such a radius a does exist, especially if ka is of the order of or less than 
unity. The reason for this is that the first and largest maximum of 
ji(kr) lies roughly at r = 1/k; for r much smaller than this, 7: is small 
and increases about as 7 [see Eq. (15.7)]. Thus if a<1/k, j: will be 
very small where U is appreciable; then the /th partial wave will hardly 
be affected by the potential, the phase shift 6, will be very small, and 
the contribution to the scattering from that / will be negligible. It 
follows then that the scattering cross section consists of a series of terms 
extending from 1 = 0 to a maximum / that is of the order of ka. Since 
the computation of the phase shifts is usually a tedious affair, the smaller 
the magnitude of ka, the easier the method isto apply. Thus this method 
of partial waves is most useful at low bombarding energies. 

It is interesting to note that the classical distance of closest approach 
of a free particle of mass y», velocity v, and angular momentum /h to the 
origin is li/uv = 1/k. Thus the foregoing remarks are analogous to the 
statement that a classical particle is not scattered if it has sufficient 
angular momentum so that it does not enter the potential region r < a. 


CALCULATION OF 4: 


The phase shift 4; is computed by fitting the radial wave function F,(r) 
for r < a, which may have an analytic form and can always be found 
numerically if necessary, to the solution (19.7). The boundary condition 
at r = a is that (1/R,)(dR./dr) be continuous. Thus if y is the ratio of 
slope to value of the interior wave function, we have that 


kl ji(ka) cos 8 — nj(ka) sin &] _ 
ji(ka) cos 6; — mika) sin 6; a 


where the derivatives j, and n, may be rewritten with the help of (15.10). 
Then 6; is given by 


kji(ka) — yjr(ka) 


tan 61 = Teni(éa) — yimi(ka) 


(19.15) 
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Equation (19.15) can be used at once to obtain an approximate 
expression for 6, when / is large and 6; is expected to be small. In this 
case, 7, will differ little from the ratio of slope to value of the solution 
in the absence of a scattering potential, so that we put 


ji(ka) 


_ faGea) 
71 k Re + ej le2| K jilka) (19.16) 
Equation (19.15) can be rewritten with the help of (15.9) as 

fan 9) Sl eee) (19.17) 


€] (ka) a (ka) Ni (ka) —] 


which is still exact. If now we make use of the power series expansion 
for j, from (15.7) when J > (ka)?, and use (15.7) and (15.8) to estimate 
the order of magnitude of m, the inequality in (19.16) becomes 


lal <« a (19.18) 
and (19.17) may be approximated as 

Oaae €)(ka) +2 oe €2?"(1!)2(ka) 2+? | 

can (C01 a (Ch ee 


Equation (19.19) can be used to verify the convergence of the sums 
over partial waves such as appear in (19.11). We use Stirling’s formula 
_to find the leading terms in In |6,| when J is large, and we neglect terms of 
order In J and lower. 


In |8:| ~ In Je] + 2l[In (ka) + 1 — In 2] — 21 In1 


Thus even if {e| has the maximum value indicated by (19.18), 6; falls off 
like the inverse factorial of / (faster than exponentially), and the series 
that appear in the expressions for the scattering converge quite rapidly 
for large 1. 


RELATION BETWEEN SIGNS OF 6; AND V(r) 


It is apparent from (19.19) that, when / > (ka)?, 6: has the opposite sign 
from e. If now the potential-energy term V or U is positive, corre- 
sponding to forces that are mainly repulsive, Eq. (19.2) shows that the 
ratio of curvature to value for the radial wave function is more positive 
than in the force-free case. This means that the ratio of slope to value 
is more positive at r = a than is the case if U = 0. Thus a repulsive 
potential makes « positive and 6; negative. A negative phase shift means 
that the radial wave function is ‘‘pushed out” in comparison with the 
force-free wave function. 

In similar fashion, we see that a negative potential makes e negative 
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(a) () 


Fig. 22 Graphs of the effects of (a) positive (repulsive) potential and (b) negative 
(attractive) potential on the force-free radial wave function rjo(kr) (dotted curves). 
The solid curves are xo(r). (Graphs supplied by J. L. Schwartz and H. M. Schey.) 


and 6 positive. This means that the radial wave function is “pulled in” 
by the attractive potential. 

This may be seen graphically by comparing rjo(kr) and xo0(r) when 
they are arbitrarily made to start out with the same slope at r = 0. 
Figure 22a and b show the results of numerical computations performed 
for repulsive and attractive square potentials, respectively. The units 
chosen are such that A = 1, m = 4, and the length unit is one-fifth of the 
total length of the abscissa shown in the graphs. The potential has 
radius a = 1 and strength Vo = +2; the energy is EH = 4, so that ka = 2. 
In each case the dotted curve is rjo(kr) and the solid curve is xo(r). 


RAMSAUER-TOWNSEND EFFECT 


Figure 22b suggests that an attractive potential might be strong enough 
so that the 1 = 0 partial wave is pulled in by just half a cycle and its 
phase shift is 180°. If this were the case, the corresponding term in the 
expression (19.11) for f(@) would vanish, and there would be no contribu- 
tion to the scattering. This situation is shown in Fig. 23 which, like 
Fig. 22, was computed numerically. Here a = 2, Vo = —5, and E = 1, 
so that ka = 2. The dotted curve is rjo(kr), and the solid curve is xo(r); 
the potential width is also indicated. 

In such a situation, the possibility arises that ka can be small 
enough and the attractive potential strong enough so that all other phase 
shifts are negligibly small when 69 = 180°. The scattering amplitude 
f(@) then vanishes for all 6, and there is no scattering. This is the 
explanation! of the Ramsauer-Townsend effect, the extremely low mini- 


1 This explanation, suggested by N. Bohr, was shown to be quantitatively reasonable 
by H. Faxén and J. Holtsmark, Z. Physik 46, 307 (1927). 
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Fig. 23 Graphs of rjo(kr) (dotted curve) and xpo(r) 
(solid curve) when the attractive square well potential 
is strong enough to make 8) = 180°. The two wave 
functions start out with the same slope at r = 0, and 
xo(r) is pulled in by half a cycle. (Graphs supplied by 
J. L. Schwartz and H. M. Schey.) 


mum observed in the scattering cross section of electrons by rare-gas 
atoms at about 0.7-ev bombarding energy. A rare-gas atom, which 
consists entirely of closed shells, is relatively small, and the combined 
force of nucleus and atomic electrons exerted on an incident electron is 
strong and sharply defined as to range. It is clear that this minimum 
cross section will occur at a definite energy, since the shape of the wave 
function inside the potential is insensitive to the relatively small bom- 
barding energy whereas the phase of the force-free wave function depends 
rapidly on it. 

Physically, the Ramsauer-Townsend effect. may be thought of as a 
diffraction of the electron around the rare-gas atom, in which the wave 
function inside the atom is distorted in just such a way that it fits on 
smoothly to an undistorted wave function outside. This effect is anal- 
ogous to the perfect transmission found at particular energies in the one- 
dimensional problem considered earlier [see discussion of Eq. (17.5)]. 
Unlike the situation in one dimension, however, the Ramsauer-Townsend 
effect cannot occur with a repulsive potential, since ka would have to be 
at least of order unity to make 59 = —180°, and a potential of this large 
range would produce higher 1 phase shifts. 


SCATTERING BY A PERFECTLY RIGID SPHERE 


As a first example of the method of partial waves, we compute the scatter- 
ing by a perfectly rigid sphere, which is represented by the potential 
V(r) = + forr < a,and V(r) = Oforr>.a. The solution for r > a 
is just Eq. (19.7). The boundary condition, obtained in Sec. 8, that 
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u(a,6) = 0, is equivalent to the requirement that all the radial functions 
vanish at r =a. The phase shifts may then be obtained by setting 
either R,(a) given by (19.7) equal to zero, or y; in (19.15) equal to infinity: 


_ ju(ka) 
tan 6, = aka) (19.20) 
The calculation of the scattering is particularly simple in the low- 
energy limit: ka = 27a/X K 1. Then substitution of (15.7) into (19.20) 
gives as an approximation for the phase shifts 


= (ka)2+1 
(21 + 1)[(2d — 1)!!]? 


Thus 6, falls off very rapidly as / increases, in agreement with (19.19). 
All the phase shifts vanish as k — 0; however, the | = 0 partial wave 
gives a finite contribution to the scattering because of the factor 1/k? 
that appears in (19.12) and (19.13). We thus obtain 


tan 6; ~ (19.21) 


o(0) = a? o = 4ra? (19.22) 


The scattering is spherically symmetrical, and the total cross section is 
four times the classical value. 

In the high-energy limit (ka > 1), we might expect to get the classi- 
cal result, since it is then possible to make wave packets that are small in 
comparison with the size of the scattering region, and these can follow 
the classical trajectories without spreading appreciably. This corre- 
sponds to the ray limit in the wave theory of light or sound. The dif- 
ferential scattering cross section is rather difficult to find, and we only 
indicate the computation of the leading term in the total cross section. 
Substitution of (19.20) into (19.13) gives 


4n © (21 + 1)j2(ka) 

B 2, 5(a) + na) oie 
We can make use of asymptotic expansions of Bessel functions that are 
valid when the argument is large and the order is smaller than, of the 
order of, and larger than the argument.! The calculation shows that 
most of the contribution to the sum in (19.23) comes from 


l < (ka) — C(ka)* 


where C is a number of order unity; the leading term here is $(ka)?. 
The other two parts of the sum, for (ka) — C(ka)? < 1 < (ka) + C(ka)! 
and for 1 > (ka) + C(ka)', each contribute terms of order (ka)! and 


1 Watson, op. cit., chap. VIII. 
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hence may be neglected in the high-energy limit. Thus 
o = 2a? (19.24) 


which is twice the classical value.! 

The reason for the apparently anomalous result (19.24) is that the 
asymptotic form of the wave function is so set up in Eq. (18.10) that in 
the classical limit the scattering is counted twice: once in the true scatter- 
ing (which turns out to be spherically symmetric as it is in the classical 
problem) and again in the shadow of the scattering sphere that appears 
in the forward direction, since this shadow is produced by interference 
between the incident plane wave e*** and the scattered wave f(@)e*"/r 
[see also the discussion of Eq. (19.14)]. However, so long as ka is finite, 
diffraction around the sphere in the forward direction actually takes place, 
and the total measured cross section (if the measurement can be made so 
that it includes the strong forward diffraction peak) is approximately 
27a’. 


SCATTERING BY A SQUARE WELL POTENTIAL 


As a second example of the method of partial waves, we consider the 
somewhat more complicated problem of the scattering from the spheri- 
cally symmetric square well potential illustrated in Fig. 13, Sec. 15. 
The interior (r < a) wave function that is finite at r = 0 is seen by 
analogy with Eq. (15.11) to be 


Ri(r) = Byler) oe Keel (19.25) 


Thus the phase shifts are given by Eq. (19.15), where the ratio of slope 
to value of the /th partial wave at r = a is 


= ailee) (19.26) 


In the low-energy limit (ka « 1), substitution of (15.7) into (19.15) 
gives for the first two phase shifts 


2 
tan d5 = — pute 
ans ; (19.27) 
mA a — y1a 
tan 6; 3 2+ ya vid 


1 The next term has been calculated by S. I. Rubinow and T. T. Wu, J. Appl. Phys. 
27, 1032 (1956); the result is 


0.99615 
(ka)§ 


o = Prat | + 
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Unless you = —l or y1a = —2, both of these vanish ask— 0. As with 
the rigid sphere, however, the / = 0 partial wave gives a finite contribu- 
tion to the scattering because of the factor 1/k? that appears in (19.12) 
and (19.13). From Eq. (19.26), we see that yoa = aacot aa — I, so 
that 


: 2 
o = 4na? (1 oc =) (19.28) 
aa 


The scattering is spherically symmetrical. 

The conclusion reached here and in connection with the rigid sphere . 
that the low-energy scattering is substantially independent of bombard- 
ing energy and angle of observation is almost always valid for any 
potential that has a finite range. Exceptions can arise, as pointed out 
after Eq. (19.27), if any one of the y; is such that the denominator of 
the expression for tan 6; is very small. In such a situation, the [th 
partial wave is said to be in resonance with the scattering potential; then 
it usually dominates the scattering. 


RESONANCE SCATTERING 

An approximate expression for the resonance cross section can be obtained 
by making use of the fact that y; decreases linearly with increasing a 
when «a is sufficiently close to ao = (2uVo/h?)?. Increasing a causes the 
interior wave function to bend over more rapidly and so decreases the 
ratio of slope to value at r = a. Now 


k? 
a= Ue ORT Fae 
when k is small, so that we can write to lowest order in k 
yia = y°a — bi(ka)? 


where y;° is the value of y; when a = ao, and &; is a positive number of 
order unity.! Substitution into (19.27) and then into (19.12) gives for 
the leading term in the differential cross section, in the two cases for 
which the value of / for the partial wave that is in resonance is 0 and is I, 
a? 
(Fo — bok?a?)? + (ka)? 
9a? cos? 6 (ka)4 
(¢1 — bik?a)? + (ka)® 
We have put tc = yo°a + 1 and {1 = y1°a + 2; for resonance, |to| and 
\¢1| are small compared with unity. It is easy to show then that (19.29) 


a(0) = 1=0 (19.29) 


o(8) = L=1 (19.30) 


1 It can be shown that the [th partial wave is exactly in resonance at zero bombarding 
energy when y:°2 = —(l + 1); in this case b: = } for alll. Compare with footnote 1, 
page 88. See also B. W. Downs, Am. J. Phys. 30, 248 (1962). 
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is a monotonically decreasing function of ka; however (19.30) has a sharp 
maximum at ka ~ (£1/b:)! if {1 is positive, and a much lower maximum 
at ka ~ (2|¢3|bi)* if ¢1 is negative. 

If we make use of the relation {9 = aca cot aca, we see from (19.29) 
that a suitable approximation for the total cross section when the / = 0 
partial wave is in resonance is 


dr 


(a 
k? + ay? cot? aod 


(19.31) 
It is apparent that the / = 0 partial wave is in resonance at low bombard- 
ing energies whenever aoa is approximately an odd multiple of 7/2, so 
that Voa? = 1°h?/8u, 9n?h?/8y, etc. The discussion of Eq. (15.3) shows 
that these are just the values of Voa? for which new energy levels with 
1 = Oappear. It is true quite generally that a potential well (not neces- 
sarily square) that has an energy level nearly at zero exhibits a resonance 
in the low-energy scattering of particles with the same / value (not neces- 
sarily zero) as the energy level. From a physical point of view, we can 
say that an incident particle that has nearly the right energy to be bound 
by the potential tends to concentrate there and produce a large distortion 
in the wave function and hence a large amount of scattering. 

Sharp resonance maxima in the low-energy scattering like that found 
above for / = 1 with positive (1 can appear for any / value except 1 = 0, 
provided that the potential well is not quite deep or broad enough to con- 
tain a new energy level of that angular momentum (this corresponds in 
the case of the square well to having 1 small and positive). We can think 
of such a potential physically as containing a virtual energy level slightly 
above zero. Although a discrete energy level cannot exist with positive 
energy, the positive “centrifugal potential” /(J + 1)h?/2ur? [see the dis- 
cussion of Eq. (14.18)] for 1 > 0 acts as a potential barrier that impedes 
the escape of a particle that is in the virtual energy level. Figure 24 
_ illustrates this barrier, which is characterized by a small transmission at 
low energies in the same way as is the barrier of Fig. 14 [see the discus- 


small transmission for E slightly 
greater than zero. 


Effective 
Evunal ML + 1) he 
| By oar Fig. 24 The effective potential 
\ Ps energy [V(r) plus “centrifugal 
i potential’]for! > 0,when V = 0 
0 sma! Fait id a aa for r >a. The dashed portion 
| Fs for r < a depends on the shape 
’ of V. The effective potential 
i ra barrier. (r >a) resembles the 
| ao barrier of Fig. 14 in that it has a 
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sion of Eq. (17.7)]. Thus the virtual level has a kind of transient exist- 
ence and produces a greater distortion of the incident wave function at 
its energy than at neighboring energies. In the limit of zero energy, it 
can be shown that the transmission of the centrifugal barrier is zero 
(see Sec. 34), so that the particle cannot leak out if 1 > 0. Resonance 
at zero energy thus leads to a true bound state if J is not zero. 


ANGULAR DISTRIBUTION AT LOW ENERGIES 


When the bombarding energy is small but not zero, the partial wave 
1 = 1 may have an observable effect on the scattering. If only 59 and 8; 
are appreciably different from zero, Eqs. (19.12) and (19.13) become 


a(@) = a [sin? 9 + 6 sin do sin 51 cos (61 — 59) cos @ 
+ 9 sin? 6; cos? 6] (19.32) 


¢ = a (sin? 59 + 3 sin? 61) 


In the absence of resonance, we see from Eqs. (19.27) and (19.32) that 
the ratio of the contributions to the total cross section of the partial waves 
l = landl = Oisof order (ka)*. However, the ratio of the largest angle- 
dependent term in the differential cross section (which is proportional to 
cos 9) to the constant term is of order (ka)?. 

Thus the partial wave 1 = 1 manifests itself in the angular distribu- 
tion at a lower energy than that at which it becomes significant in the 
total cross section; this is because of its interference with the stronger 
partial wave! = 0. For example, if 59 = 20° and 6; = 2° at a particular 
bombarding energy, the partial wave / = 1 contributes only 3 percent to 
the total cross section but it makes the forward scattering (9 = 0°) 3.5 
times as great as the backward scattering (@ = 180°). 


20C1SCATTERING BY COMPLEX POTENTIALS 


It has been assumed thus far that the potential energy V(r) isreal. This 
is a natural assumption since V has been taken over from classical 
mechanics in accordance with Eq. (6.14). However, there are situations 
in which it is convenient to make use of a complex potential as a highly 
abbreviated description of a much more complicated physical system. 
For example, neutrons may be scattered elastically or inelastically from 
atomic nuclei. In the first case the nucleus is left in its original state, 
and the sum of the kinetic energies of neutron and nucleus is the same 
before and after the collision. In an inelastic collision the nucleus is 
left in an excited state, and the quantity Q, defined in connection with 
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Eq. (18.5), is negative. Inelastic scattering and the absorption of the 
neutron to form a different nucleus are both processes in which neutrons 
are taken out of the incident beam and do not reappear as part of the 
scattering amplitude f(6,¢). 

So far as the effect of these other processes on the elastic scattering 
is concerned, a useful approximation consists in regarding the interaction 
between neutron and nucleus as being represented by a complex potential.! 
It is usually called the optical-model potential, in analogy with the long- 
standing use of a complex refractive index for discussing the optical 
properties of partially absorbing media. This section describes the 
modifications in the theory thus far developed that are necessitated by 
the assumption that V may be complex. 


CONSERVATION OF PROBABILITY 


It was shown in Sec. 7 that the position probability density P(r,é) and 
the probability current density S(r,t), defined by Eqs. (7.1) and (7.3), 
obey a differential conservation relation. The derivation of this relation 
made essential use of the assumption that V is real. If we now assume 
that V is complex and write it for future convenience in the form 


V = Vr —i1Vz 


where Vez and V, are real, a repetition of the derivation shows that the 
conservation relation is replaced by 


oP(r,t) 2Vr 


ar ee =e h 


P(r,t) (20.1) 
Since P is nonnegative, the right side of this equation acts as a source 
of probability if Vr is negative and as a sink if Vz is positive. For the 
physical reasons discussed above, we shall always assume that V; > 0. 
If there is a region in which V-S = 0, Eq. (20.1) shows that P decays 
exponentially in time, P « e—?¥1/*, so that the wave function is expected 
to have the time dependence py « e“#-Vi#Ah, 

Integration of Eq. (20.1) over a fixed volume 2 bounded by the 
surface A leads to 


0 2 
< I P(r,t) dr = — i SidA — 5 I, ViP dr (20.2) 


in place of Eq. (7.4). If y-is sufficiently well bounded at large distances 
so that the surface integral vanishes when © is the entire space, we see 


1 It was first used for this purpose by H. Feshbach, C. E. Porter, and V. F. Weisskopf, 
Phys. Rev. 96, 448 (1954). For reviews of the subject, see P. E. Hodgson, ‘The 
Optical Model of Elastic Scattering’? (Oxford, New York, 1963); A. L. Fetter and 
K. M. Watson in ‘‘Advances in Theoretical Physics’? (Academic, New York, 1965). 
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that the normalization integral decreases in time. This is to be expected 
since particles are being absorbed. 

It is more interesting to apply Eq. (20.2) to a scattering situation. 
In this case we require y to be stationary, so that it has the time depend- 
ence y « e £4, The discussion of Eq. (20.1) then shows that there is 
no region in which V; # Oand V-S = 0. The left side of (20.2) is now 


zero, and as ® becomes large the first term on the right side, — i , S,daA, 


becomes the total inward flux of particles entering 2 from very large 
distances. In the discussion of Eq. (19.6) it was pointed out that the 
vanishing of this integral implied that the particles scattered outward 
are supplied by the incident plane wave. In the present situation we 
see that this integral is the flux of particles removed from the incident 
plus elastically scattered wave by the complex potential, provided that 
y is normalized to a plane wave of unit amplitude at infinity. It may 
therefore be set equal to voays, where v is the relative velocity and op, is 
the sum of the total absorption and inelastic cross sections. We thus 
obtain the relation 


2 
ave = 5 f Vilp|? dr (20.3) 


Although this equation is valid whenever y is properly normalized, it is 
useful only if y is known in the region where V; # 0. 


COMPLEX PHASE SHIFTS 


If the potential is spherically symmetric, the method of partial waves 
can be employed whether or not V is complex. The discussion of Eq. 
(19.5) showed that the phase shifts are real if the potential is. The 
same argument shows that 6; is generally complex if V is complex, and 
so we put 


6: = ar + 28: 
where a; and 8; are real, and also define 
S1, = e281 = erar—28, 


The derivation of the expression for the scattering amplitude is unchanged, 
so that Eq. (19.11) becomes 


f(0) = (2ik)— ) (21 + 1)(S; — 1)Pi(cos 8) (20.4) 
1=0 


The differential elastic scattering cross section is |f(6)|?, and the total 
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elastic cross section is 


Oe = 20 L |f(6)|? sin 6 d@ 


= a 2, (21 + 1)(1 — 2e-%8 cos 2a; + e~461) 


= iB 2 (21 + 1)|1 — S|? (20.5) 


This reduces to Eq. (19.13) when 8; = 0, as expected. 
The absorption cross section can also be expressed in terms of the 


phase shifts. As pointed out above, it is equal to —v-! I, S, dA, where 
it is now convenient to choose the surface A to be a large sphere of radius 
r: 


ee Pia r? [ [F S, sin 0 do de 


Vv p30 


_ lim 2z7r? I " §, sin 6 d@ (20.6) 


To 


The radial flux is 


_ hf ,du du* 
8 = gn (u ar a) 


Differentiation of Eq. (19.1) gives 
Ou _ d= 1 =X! P,(cos 6) + o(Z ) 


or 
1=0 


so that Eq. (20.6) becomes 


_ Qxt . 1 9X1 dx? 
Tabs = Kk, L2b+ 1 + 1 (xt or or x) (20.7) 


evaluated at r = ©. The asymptotic form (19.8) shows that 


ai [sin (kr — lr + a1 — 781) cos (kr — dlr + a1 + 781) 
— cos (kr — 1a + a — 78,) sin (kr — dla + a1 + 78))] 
— |All? af sin (—278,) = — dad sinh 28, 
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Substitution of this and the expression |A,|? = (21 + 1)%e-*: into Eq. 
(20.7) yields 


Tabs = BD, (21+ 1) — e*) 


=5 > +00 — |S) (20.8) 
1=0 


We thus expect that 6; > 0. 
The total cross section is defined to be the sum of the elastic and 
absorption cross sections. From Eqs. (20.5) and (20.8) it is 


or X 
Ctot — Ww (21 + 11 — €-% cos 2a1) 
k? 2 , 


= a Y t+ fl — Re (8) , (20.9) 
1=0 


where Re denotes the real part of what follows. 


ASYMPTOTIC RELATIONS 


Three interesting and useful relations, the reciprocity theorem, the 
generalized optical theorem, and the optical theorem, can be derived 
from a consideration of the asymptotic behavior of the scattering wave 
function. We now obtain them in the order just given, although this 
is not the historical sequence.! 

It is convenient to rewrite Eq. (18.10), with A = 1, in a form that 
does not emphasize the special role played by the z axis: 


u(r) <> exp (ak +r) + 7~'f(k,,k)e*" (20.10) 
Here the plane wave is incident along the direction of k, and the amplitude 
f(k,,k) corresponds to scattering from k to k,, which is a vector that has 


the magnitude k and the direction of r. For another solution that corre- 
sponds to the initial direction —k’, . 


U_yr(r) —> exp (—7k’ +r) + rf (k,, —Ee*” (20.11) 
Since u(r) and u_y-(r) both satisfy Eq. (18.8) with the same value of E, 
it follows that 

Unt V2Uy — UyV?2u_y = 0 (20.12) 
whether V is real or complex. | 


1 The optical theorem [Eq. (20.23) below] was first derived in this way by E. Feenberg, 
Phys. Rev. 40, 40 (1932). This approach was subsequently gencralized to yield the 
other two theorems by R. Glauber and V. Schomaker, Phys. Rev. 89, 667 (1953). 
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Equation (20.12) may be integrated over a large spherical volume 
Q of radius r, and this integral can be transformed into an integral over 
the surface of the sphere: 


i (U_yrV UX = UV u_x) der 
2x fx OUux Ou_y . 
= 72 oak = 
r ie is (u or Uk ae ) sin 6, d6, do, = O 


where @,, ¢, are the polar angles of r with respect to some arbitrarily 
chosen polar axis. Substitution of the asymptotic forms (20.10) and 
(20.11) gives in the limit of large r 


r? a i {ik(cos 6 + cos 6” )e**r(cos 6 cos 6”) 


+ [Fa + cose” - Fs | Petrone 
tk 1 os 4 : 
— |— (1 — cos @) — S| f(k,, —k’)e*r+008 } gin 9, do, dd, = 0 
r r? 


where @ is the angle between k and r, and 6’ is that between k’ and r. 
The first term in the integrand can be seen to vanish on integration over 
the directions of r, since for every pair of angles 6, 6’ there is a canceling 
pair 7 — 6’, — 6. The 1/r? terms in the integrand, as well as other 
terms of order 1/r? that were ignored in the expansion (20.10), can be 
neglected for a reason that will become clear from the discussion of (20.14) 
below. 
We are thus left with the equation 


ihr [O" [F (CL + cos 6’) (kc) ert con 09 
— (1 — cos 6)f(k,, —k’)e*r(1+ 008] sin 6, d6,d¢, = 0 (20.13) 
which is to be evaluated in the limit r— ©. In the first term we can 
shift the polar axis from its original arbitrary choice to the direction k’ 


and thus replace the variables of integration 6,, ¢, by 6’, ¢’. Then, with 
w = cos 6’, we consider the integral 


r ie F (w,¢’)e*r—») dw (20.14) 


in the limit of large r, where F(w,¢’) = zk(1 + w)f(k,,k). Integration 
by parts gives 


a 1\ pikr(1—w) oare ah oF skr(1—w) 
prime Je 2S een” dw 


It is apparent that further partial integrations lead to successively higher 
powers of r in the denominator, so that only the first term need be kept 


CONTINUOUS EIGENVALUES: COLLISION THEORY 135 


in the limit r—> 0. This is the reason why the above-mentioned terms 
of order 1/r? could be neglected. We thus find that the limit of (20.14) 
is 


t [F(L,¢!) — F(-L¢'e] oe 


With the above form for F(w,¢’), we see that F(—1,¢’) = 0. How- 
ever the second term of (20.15) could have been dropped in any event, 
since the factor e2*" oscillates about zero as T—> ©. This is an applica- 
tion of the method of stationary phase, which is sometimes useful in 
obtaining asymptotic expressions for integrals.’ In this case the point 
w = 1is called the stationary phase point, since the phase of the integrand 
of (20.14) is stationary as w approaches unity. 


RECIPROCITY THEOREM 


The stationary phase point of the first term in (20.13) is cos 6’ = 1 or 
6’ = 0. This corresponds to r being along k’ or to the substitution of 
k’ for k, in the scattering amplitude f(k,,k). Further, the integrand does 
not depend on ¢’, so that the ¢’ integration only introduces a multiplica- 
tive factor 27. In similar fashion, the second term in (20.13) has its 
stationary phase point at cos @ = —1 or @ = , so that k, is replaced by 
—k in f(k,, —k’). 
We thus obtain the reciprocity theorem 


f(k’,k) = f(—k,—-k’) (20.16) 


which states that the amplitude for scattering from k to k’ is equal to 
that for scattering from the reversed final direction —k’ to the reversed 
initial direction —k. It is important to note that this theorem is valid 
whether V is real or complex. 

It often happens that V(r) = V(—r), so that V possesses space- 
inversion symmetry (see Sec. 29). Then all vectors in the scattering — 
problem may be inverted without altering the physical situation, and 
we have that 


f(—k,-k’) = f@&K) (20.17) 
Combination of Eqs. (20.16) and (20.17) yields a slightly specialized 
reciprocity theorem: 

f(e'k) = fk’) (20.18) 


GENERALIZED OPTICAL THEOREM 


The procedure followed in the two preceding subsections can now be 
repeated, starting with uy in place of uy. Then Eq. (20.12) is replaced 


1 See, for example, C. Eckart, Rev. Mod. Phys. 20, 399 (1948). 


136 QUANTUM MECHANICS 
by 
47, 
urV2u, — UV2up + a Vrupur = 0 


Integration over a large spherical volume @ of radius r leads to 


x a 
rt fr i (ut3 xe oe _ us vk) sin 6, d6, do, + + Hf Viubun dr =) 


Substitution of asymptotic forms oe (20. 10) in the first term then gives 
in the limit of large r 


r? I e i. . {ik(cos 6 + cos es e008 F 
ite E (1 + cos 6’) — J | 1th) 6") 


+|2 (1 + cos @) + 3 |7¢6, J et 008 0) 


4 2k 2 7) sin 6, d6, dd, 


4 Su ch [ Viutu dr = 0 x 19) 


where the volume integral is now over all space. 

As before, the first term integrates to zero, and the terms that are 
products of 1/r? and exponentials can again be neglected. The two 
remaining exponential terms can be evaluated by the method of stationary 
phase. The first term has its stationary phase point at the same place: 
cos & = 1 ork, = k’. However, the second term now has its stationary 
phase point at cos 9 = 1 or k, = k, rather than at k, = —k as before; 
there is also a sign change that arises from the altered sign in the exponent. 
Equation (20.19) thus becomes 


—4af(k’ jk) + 4af*(k,k’) ++ Quik i. a i * *(k,,k’)f(k,,k) sin 6, d0, d¢, 

+ SH [ Viubued'r = 0 (20.20) 

As it stands, this relation is not useful, since the last term is difficult 

to evaluate and in general has-no simple physical interpretation. We 

thus consider first the case in which V is real, so that Vr = 0. Equation 
(20.20) is then called the generalized optical theorem: 


LP" ff $0.7 ek) sin 6, a0, dey = PE Dp*(e,e’) — fe’ k)] 
(20.21) 
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If V possesses inversion symmetry, Eq. (20.18) may be used to simplify 
this relation: 


i [5 P°Qe Afr) sin 6, d0, dd, = 2 Im [f(k,k)] (20.22) 


It is important to note that Eqs. (20.21) and (20.22) are valid only if V 
is real. 

Equation (20.22) may be used in conjunction with the Born approx- 
imation (see Sec. 38) in the following way. The Born approximation 
gives an extremely simple approximate expression for the scattering 
amplitude f which is of first order in V, and this f is real when V is real. 
Thus Eq. (20.22) enables us to calculate the imaginary part of f that is 
of lowest order in V and shows that it is actually of second order. 


OPTICAL THEOREM 


There is one situation in which the volume integral in Eq. (20.20) has a 
simple physical interpretation, that is, when k’ = k. Then, in accord- 
ance with (20.3), 


fViuguy dr = Zhv0 ar. 


since the normalization of u, agrees with that used earlier for y. Then 
Eq. (20.20) becomes 


ie cd 2 4 
ii I |f(k,,k) |? sin 6, d0, dor + abs = = Im [f(k,k)] 


The first term on the left side is the total elastic cross section o., so that 
the left side is equal to oot. 
We thus obtain the optical theorem: 


fgg Spar anp es = Im [f(k,k)] (20.23) 
This is a generalization of Eq. (19.14) in that it applies as well when V 
is neither spherically symmetric nor real. In actuality, the optical 
theorem is even more generally valid than the preceding sentence indi- 
cates, since it can be derived without reference to a potential. All that 
is needed is the asymptotic form of the probability current density 
S(r,t), obtained by substituting (20.10) into (7.3). Then the integral 
of its radial component over an infinite sphere is proportional to the 
rate of disappearance of particles from the plane plus scattered wave 
and hence is proportional to oa».. Equation (20.23) then follows at once 
from application of the method of stationary phase. The optical theorem 
is also valid when special relativity is taken into account. 

It is apparent that f(k,k) enters into the optical theorem through 
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an interference between the plane and scattered waves in (20.10). This 
interference is characterized by the phase factor e**r(!-s®) | which, as a 
function of 6, oscillates rapidly about zero for large r except in the neigh- 
borhood of 6=0. Thus the asymptotic probability current density 
consists of a part from the plane wave alone that is everywhere in the 
direction of k, a part from the scattered wave alone that is always radially 
outward and integrates to o.1, and a part that arises from interference. 
The last is appreciable only when kr(1 — cos 6) < 1; for large kr and 
small 6, this is equivalent to 6 < (kr)-?. Within this angular region 
the scattered wave interferes destructively with the plane wave, thus 
removing from the plane wave just enough of the incident flux to account 
for absorption and inelastic scattering and for elastic scattering at all 
but infinitesimal angles. 

In the asymptotic region, this angular range of interference may be 
thought of as the remnant of the shadow cast by the scatterer. This 
shadow remnant has a diameter d ~ r(kr)-? ~ (r/k)*. The true shadow 
cannot extend beyond a distance r, such that the interfering or imaginary 
part of the forward-scattered amplitude, Im[f(k,k)]/r., is of the order of 
the plane wave amplitude, which is unity, so that r, ~ Im[f(k,k)] ~ kotot. 
At this limiting distance, the shadow diameter is d ~ (r./k)* ~ (tot)!. 
Now otot is of the order of the square of the diameter of the scatterer in 
the short wavelength or classical limit. Thus scatterer and shadow are 
roughly of the same size at the limiting distance at which the shadow 
exists, as would be expected.! 


21LISCATTERING BY A COULOMB FIELD 


It was noted in Sec. 19 that the coulomb field is an exceptional scatterer 
so far as the application of the method of partial waves is concerned. 
If V(r) = ZZ’e*/r for a collision between particles of charges Ze and Z’e, 
it is easily seen that Eq. (19.3) becomes asymptotically 


xi (1) «x gtt(kr—n la r) (21.1) 


Here n = uZZ'e?/h2k = ZZ'e*/hv, where v is the relative velocity and pis 
the reduced mass. Thus the radial solutions never approach the sinus- 
oidal free-particle solutions, since there is always a logarithmic contribu- 
tion to the phase at great distances that cannot be neglected. Although 
it is still possible to obtain a solution of this scattering problem in spheri- 
cal coordinates (this is given below), the phase shifts 6, introduced in 
Sec. 19 are altered in meaning. This section presents the analytical 


1 For an extension of the discussion in this paragraph and its relation to the uncertainty 
principle, see L. I. Schiff, Prog. Theoret. Phys. (Kyoto) 11, 288 (1954). 
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work in outline and quotes the principal results from more extended 
treatments. ?! 


PARABOLIC COORDINATES 


So long as o(@) for a pure coulomb field is all that is desired, it is simpler 
to work with the separation of the wave equation in parabolic coordinates 
(see Sec. 16) than in spherical coordinates. The reason for this is that 
the desired solution depends almost entirely on the variable ~ defined in 
Eq. (16.25) and not on the other two variables 7 and ¢. It is apparent 
that the solution will not involve ¢, because of the axial symmetry of the 
problem; if now the incident plane wave term e is taken out as a factor, 
it can be made plausible that the rest of the solution does not involve 7. 
We put 


Ue = ef . (21.2) 


where u, represents the complete coulomb wave function (incident plus 
scattered wave). Now u, must contain a part whose dominant asymp- 
totic behavior is of the form r~'e*’, but no part that goes like r—'e** 
[see Eq. (18.10)]. Since an expression ef(r — z) can be of this form, 
whereas an expression e*#f(r + z) cannot, we anticipate that the function 
f appearing in (21.2) will depend only on & = r — z. 

We substitute Eq. (21.2) into Eq. (16.26), after replacing Z by 
—ZZ', and remember that E > 0. The differential equation for f is 
then 


ee + (1 — iké) a — nkf = 0 (21.3) 


The confluent hypergeometric equation 


ad°F 


dF 
2a2 + 6-27 ar = 0 (21.4) 


which has the solution F(a,b,z), is equivalent to Eq. (21.3) if we put 
where C is a constant. 


1 W. Gordon, Z. Physik 48, 180 (1928); N. F. Mott and H.S. W. Massey, ‘““The Theory 
of Atomic Collisions,’’ 2d ed., chap. III (Oxford, New York, 1949). For the mathe- 
matical background, see E. T. Whittaker and G. N. Watson, “‘A Course of Modern 
Analysis,” 4th ed., chap. X VI (Cambridge, London, 1935). 
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CONFLUENT HYPERGEOMETRIC FUNCTION 


The solution of Eq. (21.4) that is regular at z = 0 can be written as a 
power series: 


T(a + s)V(b)z* 
X, ra) + s)rd + s) 


ae a(a + 1)z? 
b1! © b(6 + 1)2! 


F(a,b,z) = 


I 


1+ Hecate (21.6) 
It is convenient to put F(a,b,z) = Wi(a,b,z) + W2(a,b,z), where Wi and 
W, are separately solutions of Eq. (21.4). An asymptotie expansion for 
F can then be obtained from the following relations: 


W(a,b,z) = Sw (6) a) (—z)-*9(a,a — 6 + 1, —2z) 


W.(a,b,z) = ie : e7z2-g(1 — a, b — a, z) (21.7) 


a(a + 1)6(6 + 1) 


g(a,B,z) ae 1 +o A+ 222! 


+ . 


The solution of Eq. (21.4) that is irregular at the origin can be taken to 
be 


G(a,b,z) = *W,(a,b,z) — 1W.(a,b,z) (21.8) 
We shall require the irregular solution for problems in which the coulomb 
field does not extend in to r = 0. 

The asymptotic form of the coulomb wave function can be obtained 


from Eqs. (21.1), (21.5), and (21.7). The result through terms of order 
ris 
Coin n? 
i[kz+n ln k(r—z)) ee SS 
cme Td + in) f E a ik(r — 5 | 
+ r—1f,(0) etar—n In | (21.9) 
where! . 
- T(1 + in) e7inin (sin246) 
F.(6) = ar(— in) 2k sin? 16 


n (21.10) 


~ 2k sin? 26 
no = arg T'(1 + in) 


1 The real quantity ¢ = arg z is defined by the relation z = |z| e*%. 


e7in in (sin?40)+tx+ 279 
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SCATTERING CROSS SECTION AND NORMALIZATION 


The f. term on the right side of Eq. (21.9) represents the outgoing 
scattered wave, since it is the only term in which the factor 7~'e*" appears. 
The first term in Eq. (21.9) similarly corresponds to the incident ‘‘plane”’ 
wave; the multiplicative factor —n?/zk(r — z) can be ignored in the 
asymptotic region. Both the incident and scattered waves are distorted 
-at infinite distances by logarithmic phase factors. In accordance with 
Eq. (18.11), the differential scattering cross section is 


7.(0) = |f-(0)|? = (aarp) 


12 
= (#5) cosec! 46 (21.11) 


This is just the formula obtained by Rutherford from classical dynamics, 
which was verified experimentally for the collisions of alpha particles 
(helium nuclei) with heavier nuclei. It should be noted, however, that 
the angle-dependent part of the phase factor in the scattered amplitude 
f-(@) given in Eq. (21.10) can manifest itself in a nonclassical way when 
the colliding particles are identical (see Sec. 40). 

If the incident beam is normalized to unit flux, the constant C 
must be chosen to be 


C = vo T(1 + inje“* 
so that the coulomb wave function is 
vT (1 + inet oF (—in,1,iké) 
= IT (1 + inet cos $F (— in, 1, Qkr sin? $6) (21.12) 


ll 


Ue 


Then the particle density at r = 0 is found from the power series expan- 
sion (21.6) to be 


[we(O)|? = |C]? = o|T(1 + in)|Pe-"* 


2n3r 


For small collision speeds (|n| >> 1), Eq. (21.18) tells us that 
Jw.(0)|? = 2rin attractive case, n < 0 
(21.14) 
2 2Tn 4. : 
lwe(O)|? = ere repulsive case, n > 0 


The second of Eqs. (21.14) is of some practical interest. The expo- 
nential is the dominant factor in the rate of reactions between positively 
charged nuclei at low bombarding energies, when the nuclear radii may 
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be assumed small enough so that the colliding nuclei have to tunnel 
through the coulomb potential barrier to zero distance in order to initiate 
a reaction. In this case e~?*22’e'/*» ig called the Gamow factor! and is the 
dominant term in the rate of many nuclear reactions at low bombarding 
energies. 


SOLUTION IN SPHERICAL COORDINATES 


In nuclear collision problems, such as that of the scattering of protons 
of several million electron-volts energy in hydrogen, the departures from 
the coulomb law of interaction at small distances between the colliding 
particles can affect the scattering cross section. Such problems can be 
treated by a modification of the method of partial waves, developed in 
Sec. 19, in which an expansion in spherical harmonics is made for the 
pure coulomb field, and modifications introduced for the first few / values. 
In order to apply such a technique, we require first a solution for the 
pure coulomb scattering in spherical partial waves. 
We put 


= ») Ri(r)Pi(cos 6) (21.15) 
t=0 


where the radial wave equation is 
14 (#% 2nk Ul + >| m= 0 


r? dr dr r 


If we substitute #,(r) = re**f,(r), the equation for f; becomes 


+ | - (21.16) 


rch + [2ikr + 20 + 1)) 2! + [2k + 1) — anki =O (21.17) 


This is ade to the confluent hypergeometric equation (21.4) and 
has as its solution that is regular at r = 0 


fi(r) = C.F(L+ 1 + in, 2l + 2, —2tkr) (21.18) 


The asymptotic form of (21.18) can be found from (21.7) and gives for 
the radial wave function at great distances 


Crete*+a0'(21 + 2) 


Rit) =? GTC + 1 + in)ir 


sin (kr — 3lr — nln 2kr + m) 


(21.19) 

where m = arg I'(i + 1 + in). 
The coefficients C; must be determined so that the partial wave 
expansion (21.15) is identical with the solution (21.12) in parabolic 


1G. Gamow, Z. Physik 61, 204 (1928); R. W. Gurney and E. U. Condon, Phys. Rev. 
33, 127 (1929). 
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coordinates. Because of the orthogonality of the neereors polynomials, 
we have the relation 


Ris) = 2 Et [f" Pilcos é)ue(r,6) sin 0 d0 (21.20) 


where u-(7,@) is given by the second of Eqs. (21.12). The complete 
evaluation of this integral can be avoided by making use of the fact that 
we know all about the function R,(r) except the constant multiplying 
factor Ci. \C; can then be found by matching the known form of R,(r) 
to Eq. (21.20) near r = 0; it is 


_ (2ck)'e*"T(L + 1 + in) 
vi (21)! 
We thus obtain as an alternative expression to (21.12) 
= o-te-ine > a ed (2ikr)te*r 
a, TC) 
FU + 1+ in, 21+ 2, —2¢kr)Pi(cos 6) (21.21) 


It is also possible to expand f.(@), given by Eq. (21.10), in a series 
of Legendre polynomials; the result is 


f.(6) = zit), (21 + 1)e*"P,(cos 6) (21.22) 


This seems surprising at first, since f.(@) does not appear to vanish in the 
limit n — 0, which corresponds to no interaction; we obtain instead 


: _ 1 
lim f-(0) = ak (21 + 1)P,(cos 6) 
The series on the right can be summed to give 
> (21 + 1)Pi(cos 6) = 48(1 — cos 8) (21.23) 
1=0 


Equation (21.23) can be verified by multiplying both sides by Py(cos 8), 
integrating sin @d@ from 0 to z, and remembering that the integral on 
the right is carried to but not through the point 6 = 0. We thus find 
that f.(@) does indeed vanish in the limit n— 0 for 6 ~ 0 but that it 
has a 6-function singularity at @ = 0 such that 

lim h ” £,(0) sin 6d = — ; (21.24) 
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for any finite 6. Equation (21.24) is easily verified by direct integration 
of (21.10), provided that the value of the integral at the lower limit is 
interpreted to be the average of its value at « as e— 0. This peculiar 
behavior arises from the long-range character of the coulomb field. No 
matter how weak the interaction is made, it always causes particles that 
pass at very great distances to be deflected through very small angles. 


MODIFIED COULOMB FIELD 


If the actual potential deviates from the coulomb form only at small 
values of r, we expect in analogy with the partial wave treatment of 
Sec. 19 that only the first few terms in the sum (21.21) will be altered. 
Since each partial radial wave function must be a solution of (21.16) 
outside of the potential anomaly, the only change we can make in the 
function f; and still have it a solution of (21.17) is to add in some of the 
irregular solution G(J + 1+ in, 21 + 2, —2zikr) defined by (21.8). The 
manner in which G is to be added in is determined by the requirement 
that the complete wave function shall represent asymptotically a coulomb 
incident plus scattered wave, plus an extra outgoing scattered wave. 

We must, therefore, substitute for each F term in (21.21) a linear 
combination of F and G in which the amount of the ingoing term W2 is 
not changed. Such a combination is 


e%.(F cos 6: + Gsin 6) = Wye + We 


The modified wave function, which is a solution of the wave equation 
outside of the potential anomaly, can then be written 


Ape rd + 1 + in) 
— 4 nr Be ce ca 
Um = Ue + v-te (an! 


(ce — 1)Wi(l + 1 + in, 21 + 2, —2ikr)Pi(cos @) (21.25) 
The asymptotic form of Un is 


(2ikr)'e** 


Um — > vt Y) (21 + Adel» (kr)! 
l=0 


sin (kr — dlr — n1n 2kr + m+ 6)Pi(cos 6) (21.26) 


As shown in connection with Eq. (19.5), if V is real each term on the 
right side of (21.26) must be a real function of r, except for complex 
multiplying factors, so that the 6, must be real. — 

The additional phase shifts 5, can be found by matching each partial 
radial wave in Eq. (21.25) to the interior solution at the edge of the 
potential anomaly, in just the same way as the phase shifts were found 
in Sec. 19. Although in Sec. 19 the phase shifts 5 represented the 
departure of the wave function from that of a free particle, they here 
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represent the departure from the wave function of a particle scattered 
by a pure coulomb field.! It can be shown from Eq. (21.25) that the 
asymptotic form of u, may be written in the form of (21.9), where f.(6) 
is replaced by 


fa(8) = f.(8) + ») k-1(21 + Le#@r) sin 8,P:(cos 8) (21.27) 
t=0 


The differential scattering cross section is just |f,.(6)|? and in general 
contains interference terms between the coulomb scattered amplitude 
f-(@) and the extra terms that are determined by the 6;. 

It is important to note that these 6, are not equal to those that 
would be obtained from the potential anomaly in the absence of the 
coulomb field. 


CLASSICAL LIMIT FOR A PURE COULOMB FIELD 


As was discussed in Sec. 12, we expect the results of quantum and classical 
theory to coincide whenever it is possible to construct wave packets that 
follow the classical trajectory without spreading appreciably and are small 
enough so that the forces are sensibly constant over their dimensions. 
The smallest spread of a wave packet during a time interval ¢ was found 
there to be of order (it/u)* or (Ad/pyv)* = (Ad)!, where d = vt is the dis- 
tance traveled by the packet in the time ¢, and A = A/2mr = h/yv is the 
reduced wavelength for the relative motion. Thus the classical theory 
can be used when (Ad)? < d, or (d/X)? > 1, where d is the distance over 
which the force varies by an appreciable fraction of itself. For a repulsive 
coulomb field, d is of the order of the classical distance of closest approach 
|ZZ'e?/iuv|. This also provides a useful estimate for an attractive cou- 
lomb field, since in all the collisions, except the relatively few for which 
the particles are scattered through large angles, they never get closer than 
this distance from each other. 
The condition for the validity of the classical theory is then 


Large n implies that the angle-dependent part of the phase of f.(@) given 
by Eq. (21.10) varies rapidly with 0, so that these rapid oscillations in 
the scattering amplitude should have little effect on the scattering when 
the colliding particles are identical (see Prob. 6, Chap. 10). 

It is interesting to note that for the coulomb field the classical limit 
is approached for small v, whereas for potentials that have a finite range 
1 The computation of the 6; in Eq. (21.25) requires knowledge of G at small r; useful 


formulas have been given by F. L. Yost, J. A. Wheeler, and G. Breit, Phys. Rev. 49, 
174 (1936). 
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a, such as are discussed in Sec. 19, the classical limit is approached when 
(a/x)* > 1, that is, for large v. This is because the “‘size” |ZZ’e?/pv?| of 
the coulomb field increases more rapidly than A = %/yv as v decreases. 


PROBLEMS 


1. Show that the coefficients of scattering by a one-dimensional square well poten- 
tial (like Fig. 14 except that Vo < 0) are given by Eqs. (17.5) if the sign of Vo is 
changed there and in the expression for a. Discuss the dependence of transmission 
coefficient on £ in this case. 


2. Show that Eqs. (18.4) and (18.7) are valid for a general binary collision if y is 
given by (18.5); make use of conservation of energy and mass. 


3. Show that, when a particle of mass m, collides elastically with a particle of mass 
mz that is initially at rest, all the recoil (mass mz) particles are scattered in the forward 
hemisphere in the laboratory coordinate system. If the angular distribution is 
spherically symmetrical in the center-of-mass system, what is it for mz in the labora- 
tory system? 

4. Express the scattering wave function (19.1) outside the scattering potential (but 
not necessarily in the asymptotic region) as the sum of a plane wave and an infinite 
series of spherical Hankel functions of the first kind [see Eqs. (15.12)]. From this 
expression and the discussion of Eqs. (15.13), show that the scattered wave is purely 
outgoing, even inside of the asymptotic region. 

5. Two objects, A and B, have irregular forms and are identical in size and shape. 
Object A is filled with a positively infinite (impenetrable) potential and scatters 
particles of mass m. Object B is made of metal and has a certain electrostatic 
capacity C when isolated from other objects. Derive expressions for the differential 
and total cross sections for elastic scattering of particles of mass m by object A in the 
limit k — 0, in terms of the capacity C of object B. 


6. Find a general expression for the phase shift produced by a scattering potential 
V(r) = A/r*, where A > 0. Is the total cross section finite? If not, does the diverg- 
ence come from small or large scattering angles, and why? What modifications are 
necessary in the calculation if A <0? Are any difficulties encountered in the 
latter case? 


7. What must Voa? be for a three-dimensional square well potential in order that 
the scattering cross section be zero at zero bombarding energy (Ramsauer-Townsend 
effect)? Find the leading term in the expressions for the differential and total cross 
sections for small bombarding energy. Show that partial waves with / > 2 can be 
neglected. 


8. State clearly the assumptions that go into the derivation of Eq. (19.31), and 
verify that it is a suitable approximation for the total cross section at low bombarding 
energies when the / = 0 wave is in resonance. 

9. Make use of Eq. (19.31) and the result of Prob. 5, Chap. 4, to obtain an approxi- 
mate expression for the total scattering cross section by a particular potential in terms 
of the bombarding energy E and the binding energy « of a particle in that potential, 
when FE and « are small in comparison with Vo. 

10. Compute and make a polar plot of the differential scattering cross section for a 
perfectly rigid sphere when ka = i, using the first three partial waves (J = 0,1,2). 
What is the total cross section in this case, and what is the approximate accuracy of 
this result when the three terms are used? 
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11. Calculate the elastic scattering and absorption cross sections for a potential of 
the form V(r) = —V,(1 + 7), r<a; V(r) =0, r>a. Vo and ¢ are positive; 
assume that & and ¢ are very small and that there is neither a resonance nor a Ram- 
sauer-Townsend effect. Keep only the leading term in k and ¢ in each cross section. 


12. Show that the following inequality holds for any potential: 


ei(0) > (=e) 


The quantity on the left side is the differential elastic scattering cross section in the 
forward direction. 


13. Protons of 200,000-ev energy are scattered from aluminum. The directly back- 
scattered intensity (@ = 180°) is found to be 96 percent of that computed from the 
Rutherford formula. Assume this to be due to a modification of the coulomb poten- 
tial that is of sufficiently short range so that only the phase shift for 1 = 0 is affected. 
Is this modification attractive or repulsive? Find the sign and magnitude of the 
change in phase shift for 1 = 0 produced by the modification. 


6 


Matrix Formulation of 
Quantum Mechanics 


In the four preceding chapters the Schrédinger wave equation was 
developed and its solutions obtained in some cases of physical interest. 
We now turn to a different formulation of quantum mechanics, in which 
dynamical variables such as the coordinates, momentum components, 
and energy of a particle appear explicitly in the equations of motion of 
the system without their having to multiply or differentiate a wave func- 
tion. The classical equations are just of this structure, and so it might 
be expected that there would be a closer resemblance between the classical 
and quantum formalism here than in the Schrédinger theory. 

This is actually the case; the principal formal difference is that the 
quantum-dynamical variables do not obey the commutative law of 
multiplication. It is convenient to represent such noncommutative 
dynamical variables, which are often simply called operators, as matrices. 
Matrix theory provides an especially flexible representation scheme, since 
there are an arbitrarily large number of ways of choosing the rows and 
columns of a matrix, all of which are of equal validity. It is because of 
the close formal resemblance between quantum matrix mechanics and 
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classical dynamics that this was historically the first formulation of 
quantum theory to be discovered, by Heisenberg in 1925." 

In this chapter we first review briefly the more important properties 
of matrices and then show their connection with quantum theory and 
their usefulness in particular problems. 


22L)MATRIX ALGEBRA 


We restrict our discussion at first to matrices that have a finite number 
of rows and columns, and then indicate how the results obtained can 
be taken over to matrices that have an infinite number of rows and 
columns.? 


MATRIX ADDITION AND MULTIPLICATION 


A matriz is a square or rectangular array of numbers that can be added 
to or multiplied into another matrix according to certain rules. We 
denote a matrix by a capital letter, such as A, and the numbers or 
elements that make it up by the same letter with subscripts, such as 
Ay; here, k designates the row and 7 the column in which the matrix 
element A; appears. Two matrices can be added when they have the 
same rank, i.e., the same number of rows and the same number of columns. 
Addition is commutative: 


A+B=B+A (22.1) 
If the sum matrix is called C, then 
Cu = An + Bu (22.2) 


A matrix A can be multiplied from the left into a matrix B if the 
number of columns of A is equal to the number of rows of B; then the 


product matrix C has the number of rows of A and the number of columns 
of B. | . 


C= AB Ca = > AzmBint (22.3) 


where the summation is over the subscript m, which denotes the columns 
of A and the rows of B. It follows at once from Eqs. (22.2) and (22.3) 


1'W. Heisenberg, Z. Physik 88, 879 (1925); M. Born, W. Heisenberg, and P. Jordan, 
Z. Phys. 86, 557 (1925). The connection between quantum matrix mechanics and 
the wave equation was established by E. Schrédinger, Ann. Physik 79, 734 (1926), 
and C. Eckart, Phys. Rev. 28, 711 (1926). The historical development has been. 
reviewed by S.-I. Tomonaga, “Quantum Mechanics,’ chap. 5 (Interscience, New 
York, 1962). 

? For a fuller discussion, see J. von Neumann, ‘“‘Mathematische Grundlagen der 
Quantenmechanik,” chap. II (Springer, Berlin, 1932; reprinted by Dover, New York). 


150 QUANTUM MECHANICS 
that the distributive law of multiplication is valid. 

A(B+C) = AB+AC (22.4) 
Also, the associative law of multiplication is seen to be valid: 

A(BC) = (AB)C (22.5) 


where the left side means that A is multiplied from the left into the 
product of B and C, and the right side means that the product of A and B 
is multiplied from the left into C. The product (22.5) is written simply 
as ABC and, from (22.3), has the explicit expression 


D = ABC Du = > AjmBmnCat (22.6) 


mn 


It is clear from Eq. (22.3) that AB is not, in general, equal to BA; 
thus the commutative law of multiplication is not generally valid. 


NULL, UNIT, AND CONSTANT MATRICES 


For an arbitrary square matrix A, the null matrix © is defined by the 
equations 


OA =0 AO=0 | (22.7) 


from which it follows that all the elements of O are zero. If A is not 
square, the elements of © are still all zero, but the O’s that appear at 
different places in (22.7) do not all have the same numbers of rows and 
columns. 

The unit matriz 1 is defined by 


1A4=A  B1=B (22.8) 


for arbitrary matrices A and B. From Eqs. (22.8) it follows that 1 is a 
square matrix whose rank (number of rows or columns) equals the num- 
ber of rows of A or the number of columns of B. Moreover, 1 must have 
unit elements along its principal diagonal (k = 1) and zeros elsewhere, so 
that the elements of 1 equal the Kronecker symbol 6, introduced in 
Sec. 10. 

The product of a number c and a matrix A is the matrix cA that 
results from multiplying each element of A by c. Thus if we define a 
constant matriz C to be a multiple of a unit matrix so that each non- 
vanishing element is c instead of unity, then 


cA =CA where Cy: = céu (22.9) 


are the matrix elements of the constant matrix C. 
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TRACE, DETERMINANT, AND INVERSE OF A MATRIX 
The trace of a matrix, often called the spur or the diagonal sum, is the 
sum of the diagonal elements of the matrix: 


tr (A) = > An (22.10) 
k 


It is easily shown, with the help of (22.3), that the trace possesses the 
cyclic property 


tr (ABC --- EF) = tr (BC :-- EFA) (22.11) 


The determinant of a square matrix, denoted by det (A), is found 
from the usual rule for the computation of the determinant of a square 
array of numbers. If A and B are square and of the same rank, then 


det (AB) = det (A) - det (B) (22.12) 


A matrix may or may not possess an inverse A-'!, which is defined 
by the relations 


AA“ =1 AA =1 (22.13) 


A is said to be nonsingular if it possesses an inverse, and singular if it 
does not. If A is nonsingular and of finite rank, it can be shown to be 
square, and either of Eqs. (22.13) implies the other (see Prob. 2). The 
kl element of A~' is just the cofactor of A, divided by det (A); thus A is 
singular if its determinant vanishes. It is readily verified that for non- 
singular matrices A, B, C 


(ABC)! = C-1B-14-1 (22.14) 


HERMITIAN AND UNITARY MATRICES 


The hermitian adjoint At of a matrix A is the matrix obtained by inter- 
changing rows and columns and taking the complex conjugate of each 
element; thus if 


B= A! then Bu = Aj, (22.15) 


It is readily verified that the hermitian adjoint of the product of a series 
of matrices is the product of their adjoints in the reverse order: 


(ABC)t = CtBtAt (22.16) 


A matrix is hermitian or self-adjoint if it is equal to its hermitian 
adjoint; thus H is a hermitian matrix if 


H = Ht (22.17) 


Evidently only square matrices can be hermitian. 
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A matrix is unitary if its hermitian adjoint is equal to its inverse; 
thus U is a unitary matrix if 


Ut = U-!} or if UUt=1 and UtU =1 (22.18) 


Unitary matrices of finite rank must be square, and then either of the 
latter two of Eqs. (22.18) implies the other. 


TRANSFORMATION AND DIAGONALIZATION OF MATRICES 


We define the transformation of a square matrix A into a square matrix 
A’ by a nonsingular matrix S, by the following equation: 


SAS-! = A’ (22.19) 


It is evident then that S~! transforms A’ back into A. 
The form of a matrix equation is unaffected by transformation. 
Thus the equation 


AB+CDE=F 
may be transformed into 

SABS—! + SCDES—! = SFS™ 
which is equivalent to 

SAS-!- SBS“! + SCS-!- SDS“! - SES“! = SFS-! 
or to 

A'’B’+C'D'E' = F' 
where the primes denote transformed matrices. This invariance of 
matrix equations with respect to transformations makes it possible to 
work with any convenient transformation of a set of matrices without 
affecting the validity of any results obtained. It should also be noted 
that tr (A) and det (A) are unchanged by transformation. 

A square matrix is diagonal if it has nonvanishing elements only 
along the principal diagonal (k = 1). The diagonal elements are then 
called the ezgenvalues of the matrix. It is easily seen that the nth power 
of a diagonal matrix is also diagonal and has as its eigenvalues the nth 
powers of the eigenvalues of the original matrix. The matrix A in Eq. 
(22.19) is said to be diagonalized by the matrix S if the matrix A’ that 
results from the transformation is diagonal, so that A,, = A,d.. To 


find A’ explicitly, we multiply (22.19) through on the right by S, where 
we assume that such an S exists. 


SA = A'S (22.20) 


The set of linear algebraic equations that are obtained by writing the 
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elements of Eq. (22.20) for a particular row k and all columns ! is 
»> SimAmi = A, Si or ») Sim(Amt = A, bmi) = 0 (22.21) 


where A, is a particular eigenvalue of A’ and the subscript m is summed 
over from unity to the rank N of the matrix A. 

Now (22.21) may be regarded as a set of N homogeneous algebraic 
equations for the transformation matrix elements Sim, where k is fixed. 
The necessary and sufficient condition that these equations have a solu- 
tion is that the determinant of their coefficients vanish, or that the deter- 
minant of the square matrix (Am: — A; dni) be zero. This provides a 
single algebraic equation, called the secular equation, which is of order 
N and has N roots A,. Thus the eigenvalues of the diagonal matrix A’ 
resulting from A by transformation are the same no matter how A is 
diagonalized, except perhaps for the order in which they are arranged; 
for this reason they are also called the eigenvalues of the original non- 
diagonal matrix A. A’ and A are said to be degenerate when two or 
more eigenvalues are equal. 

It is not difficult to see that, in general, equations that involve both 
A and At are not invariant with respect to transformation unless the 
transformation matrix S is unitary. 


FUNCTIONS OF MATRICES 


A function of a matrix can often be defined in terms of a power series. 
Thus, if f(z) is a polynomial in the number z, then f(A) is the same poly- 
nomial in the matrix A. With an infinite series, some attention must 
be paid to convergence. It is apparent that, if A can be diagonalized 
and is put in its diagonal form A’ = SAS—', then S will also transform 
f(A) into the diagonal form f(A’) and the kth eigenvalue of f(A’) is 
simply f(A;,). Thus, if the power series f(z) converges when z is equal to 
each eigenvalue of A, the power series S-'f(A’)S = f(A) is well defined. 
This last may provide a useful definition of f(A) even when f(z) cannot 
be represented by a power series. 

Exponential and trigonometric functions of matrices can be repre- 
sented by power series that converge for all finite values of their argu- 
ments and hence are well defined. It is easily seen, for example, that 
(e4)t = e4t so that e*# is unitary if H is hermitian. 

Another useful result is the relation (see Prob. 3) 


det (e4) = et (22.22) 
MATRICES OF INFINITE RANK 


The rules (22.2) and (22.3) for addition and multiplication of matrices 
may be taken over in an obvious way for matrices that have an infinite 
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number of rows and columns, provided that the infinite sum in (22.3) 
converges. We sometimes deal with matrices that have a nondenu- 
merably infinite number of rows or columns or both; in this case, one or 
both of the matrix subscripts becomes a continuous variable, and the 
usual summation over a subscript is replaced by integration over this 
variable. We do not consider these possibilities in detail here but simply 
assume that all reasonable results can be taken over from finite- to 
infinite-rank matrices without difficulty.1 The statement that a her- 
mitian matrix of infinite rank is square means that its rows and columns 
are labeled in the same way. A unitary matrix of infinite rank need 
not be square. Its rows and columns can be labeled differently; for 
example, the number of rows may be denumerably infinite and the 
number of columns nondenumerably infinite. 

We are concerned primarily in quantum mechanics with hermitian 
and unitary matrices, mainly of infinite rank. A fundamental theorem 
that we shall assume without proof is that any hermitian matrix can be 
diagonalized by a unitary transformation; a corollary of this theorem is 
that the resulting eigenvalues of the hermitian matrix are unique, 
except perhaps for the order in which they are arranged. Starting from 
this theorem, it is not difficult to show (see Prob. 1) that the necessary 
and sufficient condition that two hermitian matrices can be diagonalized 
by the same unitary transformation is that they commute (matrices A 
and B commute if AB = BA). 

It also follows from this theorem that the eigenvalues of a hermitian 
matrix are real. If the S and A in (22.19) are unitary and hermitian, 
respectively, that equation can be rewritten 


SASt = A’ (22.23) 
The hermitian adjoint of Eq. (22.23) is, from (22.16), 
SASt = A’t 


Since then A’t = A’, this shows that the hermitian property is 
maintained during transformation by a unitary matrix. If A’ is diagonal 
as well as hermitian, it follows from (22.15) that its eigenvalues are real. 
It is easily seen that the converse is also true: a matrix that can be 
diagonalized by a unitary transformation and has real eigenvalues is 
hermitian. 

It is important to note with matrices of infinite rank that both 
of Eqs. (22.13) must be valid in order for A-! to be the inverse of A. 
Similarly, both of the latter pair of Eqs. (22.18) must be valid if U is 
to be unitary. 


1A more thorough discussion of this point and a proof of the following theorem are 
given by von Neumann, loc. cit. 
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23 TRANSFORMATION THEORY 


The appearance of matrices in quantum mechanics can be connected in 
a simple way with the solution of the Schrédinger equation (8.2). In 
this section we adopt the hamiltonian notation and justify it in detail 
in Sec. 24. We rewrite Eq. (8.2) as 


Hu, (r) = Exyux (r) (23. 1) 


where the subscript k denotes the different members of the complete 
orthonormal set of energy eigenfunctions u,(r) and their corresponding 
eigenvalues E,. k specifies the energy and also distinguishes between 
degenerate eigenfunctions; it thus includes both E and s of Eq. (10.7). 
For example, k represents k,, k,, and k, in the case of plane waves and 
Nz, Ny, and n, in the case of three-dimensional harmonic-oscillator 
eigenfunctions. 
The hamiltonian or energy operator H is given by 


pela yers 2 eve) (23.2) 
~ Im ~ Im ‘ 


where we shall assume throughout this chapter that V is real, so that the 

E;, are also real. In accordance with the discussion of Sec. 8, k may be 

a discrete or a continuous variable or (as in the case of a hydrogen atom) 

discrete over part of its range and continuous over the rest. We shall 

use the generalized summation symbol S or S, to denote both a summa- 

tion bY over discrete values of the subscript k and an integration {dk over 
k 


the continuous part of its range. We shall also use the subscript ‘‘op”’ 
whenever it seems desirable to distinguish between an operator like 
(23.2) and its equivalent matrix. 


UNITARY MATRIX W 


Suppose that we have a second complete orthonormal set of functions 
v,(r), which are eigenfunctions of some operator © with real eigenvalues 
Wy: 


Qv, (1) = wv, (r) (23.3) 


Q might, for example, be the momentum operator (6.13) with eigen- 
functions (11.4) or (11.11), or the hamiltonian for some potential other 
than that of Eq. (23.2). The v, can be expanded in terms of the u, with 
expansion coefficients w,,: 


v,(r) = Se wetter (Fr) ; (23.4) 
Both sides of this equation may be multiplied by u(r) and integrated 
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to obtain 

Win = Suf(r)o,(r) dr (23.5) 
The completeness of the set u, was used to obtain Eq. (23.4), and its 
orthonormality to obtain Eq. (23.5). In similar fashion, u, may be 
expanded in terms of the »,: 

ur(t) = Sywko,(r) (23.6) 

We can now show that the matrix W, of which u,, are the elements, 
is unitary by calculating a typical matrix element of WW: 

(WW')a = Sywevwe, = Sufug (x)o(x) dr for (r’)ur(e’) d*r’ 

= ffuf(r) 68(r — r’)ur(r’) d?r d3r’ = fux(r)uc(r) d’r (23.7) 

Use has been made here of the closure property of any complete ortho- 
normal set of functions, discussed in connection with Eq. (10.11), which 
now takes the form 

Swa(rox(r’) = a3(r — r’) (23.8) 


The last integral on the right side of Eq. (23.7) is either a Kronecker 6 
symbol or a Dirac 6 function according as k is one of a discrete or contin- 
uous set of subscripts. In either case, this integral is equivalent to an 
element of the unit matrix, so that WWt = 1. In similar fashion, it can 
be established that 


(WtW) = Sew Wer = (1), (23.9) 


TRANSFORMATION OF THE HAMILTONIAN WITH W 


We have shown that W is a unitary matrix that relates two sets of func- 
tions u, and v,. It can also be used to transform the hamiltonian matrix 
H', which we define through its matrix elements 


Hi, = Sug (rt) Hopur(r) dr = Bybx or E,6(k ma l) (23.10) 
where H,, is the differential operator (23.2). Use has been made here 
of Eq. (23.1) to show that H’ is in diagonal form with diagonal elements 


that are just the energy eigenvalues E,. Another hamiltonian matrix 
H" can be constructed by using the functions v,: 


Hi, = fo*(r)Hopv,(r) dir (23.11) 


This is not in diagonal form. 
We now follow the definition (22.19) and transform H” with W: 


(WH'W')n = S.S+WewH We 
= SS, fur (r’)o,(r’) dr’ 
Soe) Hopp (2) d*r for (r’)ur(e’) d3r”’ 
= fffuk(r’)a8’ — r) Hopi tr — r’’)ur(r’’) d?r! d?r d3r”’ 
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where H., operates only on the variable r to its right. The integration 
over r’’ may be performed by taking H., outside of the r” integral: — 


SHop B(r — x”) u(r”) d?r’ = Hop f(r — 1’) w(t”) dr” = Hopur(r) 
We thus obtain 
(WH'W')n = Sux (t)Hopu(r) dr = Hy, 


. This may be written without explicit use of subscripts as the matrix 
equation 


WH'Wt = H! | (23.12) 


Thus we see that W is the unitary matrix that transforms the non- 
diagonal matrix H” into the diagonal form H’; also, W' transforms H’ 
into H”. 

Another way of putting what has been done thus far is to state that 
the process of solving the Schrédinger equation (23.1) is completely 
equivalent to the process of diagonalizing a particular matrix. The 
matrix H’’ may be calculated from the operator H,, of Eq. (23.2) by 
using any complete orthonormal set of functions v,(r) that is convenient. 
If this diagonalization can be carried through, both the diagonal form 
H’, which consists of the energy eigenvalues E;, and the diagonalizing 
matrix W are obtained in the process. Once W is known, the eigen- 
functions u(r) may be obtained by using Eq. (23.6). 

It is interesting to note that W is not necessarily a square matrix; 
that is, its rows and columns may be labeled differently. For example, 
the functions u, may be the eigenfunctions of a three-dimensional har- 
monic oscillator, which form a completely discrete set, and the functions 
v, may be momentum eigenfunctions, which form a completely contin- 
uous set. However the hamiltonian matrices H’ and H” are both square. 

It is apparent that H’ and H” are hermitian, since their eigenvalues 
are real. The discussion of Sec. 10 developed the interpretation that the 
eigenvalues of any operator that represents a physically measureable 
quantity are real, since they are the only possible results of precise 
measurement of that quantity. Thus a dynamical variable that is 
physically measurable is represented by a hermitian matrix when it is 
in diagonal form; it then follows from the discussion of Eq. (22.23) that 
any unitary transformation of this matrix is also hermitian. 


TRANSFORMATION OF THE HAMILTONIAN WITH U 


There is no need to think of the operator H given in Eq. (23.2) as having 
a fundamentally different status from the matrices H’ and H” defined 
by Eqs. (23.10) and (23.11). The natural matrix form for H,, is one in 
which the rows and columns are labeled by coordinates r and r’. The 
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corresponding unit matrix is taken to be 
(1) = 8%(r — 1’) 


in analogy with the right side of Eq. (23.7) when k is one of a continuous 
set of subscripts. A diagonal matrix can then be written as f(r) 63(r — r’), 
where f is any multiplicative function of r but not a differential operator; 
it might, for example, be the potential energy. We then expect the 
matrix form of the hamiltonian operator to be 


Hey = Hop 53(t — r’) (23.13) 


where on the right side H,, operates only on the variable r. We shall 
refer to this matrix as H whenever the subscripts are not written explicitly. 
It is important to note that H,, is not diagonal, since H,, is not a multi- 
plicative function but contains differential operators in its kinetic-energy 
part. Equation (23.13) has nonvanishing matrix elements that are 
infinitesimally removed from the diagonal r = r’. 

Equation (23.12) shows that W transforms H” into H’. It is of 
interest to inquire what the unitary matrix U is that transforms H, as 
defined by (23.13), into H’: 


UHUt = H’ (23.14) 


This equation may be written HUt = UtH’, which becomes, when 
expressed in matrix elements, (HU*t),, = (U'tH’),,, or 


SrA (U') ee, = SU(U) aA (23.15) 


Here Sy is the same as fd'r’. On substitution from Eq. (23.13), the left 
side of (23.15) becomes 


JHop F(t — 2’)(U) py Br! = Hopf dt — 2’) (Ul) ey, a'r’ 


op rk 


where H., operates on the r dependence of (Ut),,. The right side of 
Kq. (23.15) may be rewritten by making use of (23.10), so that it becomes 
(U*),, Ey. Thus Eq. (23.15) is equivalent to 


Hop ( UO") = E,( U") 


Comparison with the Schrédinger equation (23.1) shows that (Ut),, must 
be identified with the energy eigenfunction u,(r). 

We are thus led naturally to an extension of the formalism according 
to which a complete orthonormal set of functions u(r) may be regarded 
as a unitary matrix U: 


Use = up (2) (23.16) 
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In this particular case, U transforms H into H’ in accordance with Eq. 
(23.14). It is easily verified that U is, in fact, unitary; for example, 


(UU) = SU ee(Ut) a = Sug (e)u(r) d'r = (L)er 
by orthonormality, and 
(OU) i = Se(Ut) ee U nee = Sete (r) uz (r’) = (1) rx 


by closure. 


TRANSFORMATION OF THE HAMILTONIAN WITH V 


The set of functions »,(r) can, in accordance with Eq. (23.16), also be 
regarded as a unitary matrix V: 


Ver = OM (r) 


where again the unitarity of V follows from the orthonormality and 
closure of the set »v,(r). As would be expected, transformation of H 
with V gives H”: 


VHVt = RH” (23.17) 


The yy matrix elements of the two sides of this equation are precisely 
the two sides of Eq. (23.11). 

The matrix H can be eliminated between Eqs. (23.14) and (23.17) 
as follows: 


H =VtH"V- 

A’ = UAU' = UVtH"vVUt 
This agrees with Eq. (23.12) if we set 

W = UvVt (23.18) 
In matrix-element form this is 

(Win = SUV") en 
or 

Wen = Sug (r)o,(r) d?r 
which is the same as Eq. (23.5). Equation (23.18) can also be written 
in the forms Vt = UtW and Ut = VtW', which are the same as Eqs. 
(23.4) and (23.6), respectively. 
REPRESENTATIONS OF OPERATORS 


The three hamiltonian matrices H, H’, H” are referred to as different 
representations of the same operator H,,. Each representation is char- 
acterized by the quantity whose eigenvalues are used to label the rows 
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and columns of the matrix (r, E;,, w,, or, more concisely, r, k, w). Thus 
H, H’', and H” are the hamiltonian in the coordinate, energy, and Q 
representations, respectively. A unitary matrix whose rows and columns 
are labeled by the eigenvalues of A and B, respectively, transforms an 
operator from the B to the A representation. For example, U trans- 
forms any operator from the coordinate to the energy representation, 
and Ut from the energy to the coordinate representation. In similar 
fashion, V transforms from the coordinate to the Q representation, and 
W from the Q to the energy representation. Thus application of V 
followed by W transforms from the coordinate to the 2 representation 
and then to the energy representation; the overall effect is represented 
by the product WV, since the transformation farthest to the right acts 
first. Equation (23.18) shows that WV = U, as it should since U trans- 
forms from the coordinate to the energy representation. An operator 
is diagonal in the representation in which the rows and columns are 
labeled by the eigenvalues of that operator; thus H’ is diagonal since it 
is the hamiltonian expressed in the energy representation. 

The remarks in the preceding paragraph can be illustrated by apply- 
ing them to the operator 2. In the coordinate representation it is 
expected to have the form 


QO, = Ap 5(r — 1’) (23.19) 


where on the right side 2,, operates only on the variable r. We call 
this matrix © and find its transformation into the energy representation, 
which should be UQUt: 


(UQU*) at = SiSer Ue Qee (UD en | 
= Sug (t)Q.pu(r) dr 
as expected. Similarly, transformation to the Q representation gives 
(VOV"') yy = oy(L) ur 
which is in diagonal form. 
A USEFUL IDENTITY 
Equation (23.19) can be used to write 
Qopa(t) = [Qrepe(r’) d?r’ 
where ya(r) is an arbitrary function. Then 
SVZ(t)[Qopha(t)] dr = SJYEO)Qure(e’) d?r dir’ (23.20) 


where y,.(r) is another arbitrary function. We are not necessarily assum- 
ing that the y.(r) are members of a complete orthonormal set of functions. 
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Since Q,,- is a matrix with rows and columns labeled by r and r’ and hence 
is not the same as the operator Q.», we can rearrange terms on the right 
side of Eq. (23.20) to give 


STS QM rWalr) dr] *pa(r’) d?r’ 


where (0),, = 0% is the r’,r element of the matrix Qt that is the hermitian 
adjoint of &. Equation (23.19) can also be used to define the hermitian 
adjoint operator 0.,' in accordance with 


* DoptWalt) = SOM) Wale’) dr’ 
Thus Eq. (23.20) may be rewritten 
J¥aF)[Qopa(x)] dr = f[Qop'Palr)]*¥a(r) d?r (23.21) 


Equation (23.21) is sometimes useful, since an integral of this type 
may be more conveniently evaluated when ®, operates on one of the 
functions ~a, sp rather than on the other. The partial integrations in 
Eqs. (7.9), (7.10), and (12.3) are examples of the application of Eq. 
(23.21). In those cases, 2, is a differential operator, and its coordinate 
representation involves derivatives of 6°(r — r’). However, 2., need not 
be of this specialized type but might have nonvanishing matrix elements 
for finiter — r’. In any event, if 2, is to represent a physically measur- 
able dynamical variable, it must be hermitian, in which case 2,,' can be 
replaced by QQ» on the right side of Eq. (23.21). 


ROW AND COLUMN MATRICES 


We have been dealing thus far with square matrices like H and Q, which 
represent dynamical variables and are often hermitian, and rectangular 
matrices like U, V, and W, which transform from one representation to 
another and are unitary. A particular state a of a system can be repre- 
sented by a function y¥,(r), which may be thought of as a matrix with 
one column, in which the rows are labeled by the coordinate r; as remarked 
above, a need not label a complete set of functions but can be considered 
by itself. Since the u,(r) are a complete set, ¥. can be expanded in terms 
of them: 


Valt) = Sedaxte(t) (23.22) 
where the coefficients a., are given by 

Ga = Juz ()Palt) dr (23.23) 
These equations may be written in matrix form as 


va = Uta, and Qa = Uda (23.24) 
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respectively, where a, is a one-column matrix whose rows are labeled by 
k. Thus, just as U transforms an operator like H from the coordinate 
to the energy representation through Eq. (23.14), it also transforms a 
state function from the coordinate to the energy representation through 
the second of Eqs. (23.24). 

In similar fashion we can write 


Va(r) = Sudanu(L) and ban = f vn (t)¥a(r) d*r 
in matrix form as 
Va = Vibe and ba = Vda (23.25) 


Combination of Eqs. (23.22), (23.23), and (23.18) gives 
Qa = Up. = UVtb, = Wha 


so that W transforms the state function from the 2 representation ba to 
the energy representation da, as expected. 

An important property of unitary transformations is that they 
leave the norm of a state function unchanged. In the coordinate repre- 
sentation, the norm is just the normalization integral, which may be 
written as 


S¥2(t)va(r) dr = Si (e)ve(r) = valYa 


The right side is the matrix product of the one-row matrix ¥,', which is 
the hermitian adjoint of y. and has its columns labeled by the coordinate 
r, and the one-column matrix ¥2. The result is a one-row, one-column 
matrix, which is simply a number. It is evident that the norm is real 
and nonnegative. With the help of Eq. (23.24), the norm may be 
written 


Val'Wa = A.'U Utae = Aa'de 


which is equal to S;laax|?._ In similar fashion, yatv. = Batba, so that all 
three representations of the state function have the same norm. A 
state function for which the norm exists (i.e., is not infinite) may be 
normalized by dividing it by (¥.'y.)!, in which case its norm becomes 
equal to unity. 

It is interesting to note that the identity (23.21) becomes trivially 
valid when rewritten in terms of coordinate matrix representations of the 
state functions ¥. and yz: 


Valls = (Qhya) Ys 


Equation (22.16) shows that the two sides are the same. The structure 
of the matrix products is such that each side is a one-row, one-column 
matrix and hence a number. 
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HILBERT SPACE 


A geometrical picture that is often used regards a state function such as 
Wa; Ga, OF be a8 a state vector in an infinite-dimensional Hilbert space.' 
Each dimension corresponds to one of the rows of the one-column matrix 
that describes the state, and the component of the state vector along that 
axis of the Hilbert space is numerically equal to the corresponding ele- 
ment of the matrix. Different choices for the orientation of the axes in 
the Hilbert space correspond to different choices for the representation. 
For example, the energy representation corresponds to choosing axes in 
such a way that a state vector oriented along one of these axes is an 
eigenstate of the hamiltonian. More generally, the vector a. has the 
component a. along the axis that goes with the energy eigenvalue E;,. 

The norm of a state function is just the square of the length of the 
corresponding state vector in Hilbert space, where “length” is suitably 
generalized to take account of components that may be complex numbers. 
The norm is a special case of the inner product of two state vectors ¥. and 
Ws, which is defined as _ 


(Wate) = Valve = fva(t)ve(r) a'r (23.26) 


and is also a number. When a and 8 are the same state, this is equal 
to the norm. When the inner product vanishes, the state vectors are 
said to be orthogonal. The inner products of these state vectors, expressed 
in the energy and the 2 representations, are 


(da,dg) = Ga'ag and (ba;bg) = batbs 


respectively. It is apparent from Eqs. (23.24) and (23.25) that the inner 
product is independent of the representation, just as is the norm. It 
follows that a unitary transformation from one representation to another 
corresponds to a rotation of axes in the Hilbert space, without change in 
the state vectors. A particular state vector has different components 
when referred to different axes, and these constitute the different repre- 
sentations of the state. 

It also follows that operation on Wg with Q,, performs a transforma- 
tion on the state vector ¥g so that it becomes another state vector Qyg,. 
We refer to this transformation as a generalized rotation of the state vector 
in Hilbert space, the word “generalized” indicating that the rotation may 
involve stretching or contraction of the components as well as pure 
rotation; in particular, the generalized rotation usually does not conserve 
the norm of the state vector. The matrix element y,'Qys = (Wa,QW~s) is 


1R. Courant and D. Hilbert, ‘(Methods of Mathematical Physics,’’ vol. I, p. 55 
(Interscience, New York, 1953); P. R. Halmos, ‘Introduction to Hilbert Space,’’ 
chap. I (Chelsea, New York, 1957). 


164 QUANTUM MECHANICS 


then the inner product of the state vectors ¥. and Qyg. It is easily seen 
to be independent of the representation. 


DIRAC’S BRA AND KET NOTATION 


The transformation theory developed in this section can be put into 
extremely compact form by making use of a notation invented by Diraec.! 
We describe a state function or state vector, whether it be represented 
by Wa, Ge, or be, by a ket or ket vector |a), and the hermitian adjoint 
state y.', aa', or bgt by a bra or bra vector (a|. The inner product of two 
state vectors is written 


Vale = (a|8) 
and is called a bracket expression, which is a number. The first three 
and last three letters of bracket provide the names for the two kinds of 
state vectors. 


Operation on a ket vector from the left with 2 produces another 
ket vector 


2/8) = |8’) (23.27) 


and operation on a bra vector from the right with 2 produces another 
bra vector 


(a2 = ("| 


The matrix element of 2 between states a and 6 is a number and can 
be written in any of the equal forms 


Maa = Jea(e)QYe(r) d*r = f[Qty.(r)]*¥e(r) d*r 
= (Waa) = (QhPa,Wa) 
= (al6’) = (a’|B) = (a|2\8) (23.28) 
of which the first and last are the most commonly used notations. The 


matrix elements of the hermitian adjoint operator 9 are given in accord- 
ance with Eq. (22.15) by 


(Q's = Op = (B|Q"a) = (axl 2|8)* 


A particular example of Eqs. (23.27) and (23.28) is that in which & is the 
unit operator: 


(a|1|8) = (a|6) 
This equation can be used to obtain a bracket expression for the unitary 
matrix W from (23.5) by identifying ~. with uw, and wg with »,: 
Wien = (k|u) = (ulk)* (23.29) 
We are now in a position to rewrite the principal equations of this 


1P, A. M. Dirac, ‘“‘The Principles of Quantum Mechanics,’ 4th ed., sec. 6 (Oxford, 
New York, 1958). 
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section in Dirac notation. Our three representations will be specified 
by the three complete sets of kets |X), |u), and |r). Just as |k) is an energy 
eigenstate with eigenvalue E;, and |x) is an 2 eigenstate with eigenvalue 
w,, so |r) is a coordinate eigenstate with eigenvalue r (compare with Chap. 
3, Prob. 2). Then, in analogy with (23.28), Eq. (23.16) may be written 
in the forms 


(k|r) = ug (r) and (clk) = u(r) (23.30) 
Similarly, we may write 

(ult) = of (x) and (r|u) = »,(r) (23.31) 
Again, identifying ¥. with u, we obtain 

Wa(r) = (tla) ak = (ka) ban = (ula) (23.32) 


A summation convention will be employed such that any state 
symbol that appears twice in a given expression (once in the ket and once 
in the bra position) is assumed to be summed over a complete orthonormal 
set of states; thus the symbol S, will be omitted from expressions of the 
form 


See he aa (23.33) 


The rewritten equations below are numbered with primes of their original 
numbers. 


Q|n) = w,|n) or (r|Q\r’)a"|u) = @,(t|H) (23.3) 
lu) = |k) ku) or (rlu) = lke u). (23.4’) 
(ela) = (kleyXr|a) (23.5') 
\k) = |udulk) or (tlk) = (r|u)(ulk) (23.6’) 
(kluXull) = (KlrdXr|l) = ¢k|L) (23.7') 
(rlu){ulr’) = (lr’) (23.8’) 
(ulk kv) = (uly) (23.9) 
(k|H|l) = E,<k\l) (23.10’) 
(k|u ul |yXv|l) = (||) (23.12’) 
r|H|r’)r'|k) = r|l)| Ak) (23.15’) 
(ulr{r|A|r’Xr’|») = (ul |r) (23.17’) 
(k|u) = (k[r)r|u) (23.18’) 
(ria) = (r|k)(kla) (23.22’) 
(kla) = (kir)rla) (23.23’) 
(tla) = (rluXula) and (ula) = (ulr)r|a) (23.25’) 


(a|8) = (alr)r|8) (23.26’) 
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With the exception of Eqs. (23.3’) and (23.10’), which are eigenvalue 
equations for Q and H, respectively, all these equations have one property 
in common. If we assume that a repeated state symbol (k, u, or r), 
which is understood to be summed over a complete orthonormal set of 
states as in (23.33), is equivalent to the unit operator, these equations 
become trivial identities. For example, each of the two sides of Eq. 
(23.15’) becomes equal to (r|H|k), since |r’){r’| and |1)J| may be replaced 
by unit operators on the left and right sides, respectively. This replace- 
ment is evidently equivalent to the use of the closure property of any 
complete orthonormal set of functions. Conversely, unit operators can 
always be inserted, as in going from the left to the right side of (23.4’). 
The simplest proof of the correctness of these assumptions is the observa- 
tion that the removal or insertion of such repeated state symbols always 
leads to valid equations. 

Note that an insertion can also be made between two operators, 
for example, 


OP = Ok\kIT (23.34) 


PROJECTION OPERATORS 
Consider the equation 


la) = Syluule) = S,Pula) (23.35) 
where the operator P, is defined by 
P, = |u)(ul | (23.36) 


for a single state yu, that is, without the generalized summation. It is 
clear from the first of Eqs. (23.3’) that P,|a) is an eigenstate of 2 with 
eigenvalue w,. Thus an arbitrary ket |a) can be written as a generalized 
sum of eigenstates of any operator 2 through Eq. (23.35). The operator 
P, is called a projection operator, since it projects out of |a) the part that 
is a particular eigenstate of ©. 

Suppose now that @ has only two distinct eigenvalues w: and w., 
with a great deal of degeneracy. An example might be the parity oper- 
ator, with eigenvalues + 1, or a component of the spin angular momentum 
of a spin ¢ particle, with eigenvalues +4h (see the next chapter). In 
such a situation it is sometimes desirable to write any state |a) as a sum 
of two parts, each of which is an eigenstate of 2 with the eigenvalues 
wi and w, This can be accomplished by defining projection operators 
through the equations 


i= Spank s (23.37) 
with 7 = 1, 2, instead of through (23.36). Then P,|a) is an eigenstate 
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of & with eigenvalue w;. It is easily verified that 


Pi = Q = wr and P,= Q = or (23.38) 


@1 — We We — W1 


and that 
> P; = 1] PP; = 6;P; (23.39) 


Equations (23.38) can be generalized to the situation in which & 
has more than two distinct eigenvalues. With the definition (23.37), we 
now have 


P= to (23.40) 
mo 
and Eqs. (23.39) are still valid. 

Since © is hermitian, its eigenvalues w, are real, and the P, and P; 
are evidently hermitian. Finally, it is easy to see that tr (P,) = 1 if |x) 
is a normalized eigenstate that corresponds to a discrete eigenvalue. In 
this case tr (P,) is equal to the number of linearly independent eigen- 
states of 2 that have eigenvalue w,; hence it is equal to the dimensionality 
of the Hilbert subspace that corresponds to «. 


We shall have occasion to discuss projection operators further in 
Sec. 42. 


PHYSICAL MEANING OF MATRIX ELEMENTS 


We have seen that the diagonal matrix elements of an operator that is in 
diagonal form are just the eigenvalues of that operator. Comparison with 
the work of Sec. 7 shows that, if the operator is not in diagonal form, the 
diagonal matrix elements are the expectation values of operators for 
states that are normalized. More generally, (a|Q|a)/{a|«) is the expecta- 
tion value of the operator for the state represented by the normalizable 
ket |a). 

Thus far we have no simple interpretation for off-diagonal matrix 
elements such as (a|2|8). We shall see in Chap. 8 that, among other 
things, they are related to the probabilities for transitions between pairs 
of states. . 


24C EQUATIONS OF MOTION 


The transformation theory developed in the preceding section makes no 
reference to dynamics; it deals with the description of a physical system 
at one instant of time. This description involves three kinds of objects. 
First, there are those which specify the state of the system at a particular 
time. These may be written as state functions ¥,(r); as column matrices 
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Wa, Ge, or bg; as Hilbert space vectors; or as kets |a). To each of these 
there corresponds a hermitian adjoint quantity such ast or (a|. Second, 
there are unitary transformations, which may be written as rectangular 
matrices such as U, the elements of which are bracket expressions (k|r). 
These effect changes from one representation to another. From the 
Hilbert-space point of view, they rotate axes without changing the state 
vectors, so that a particular state vector is referred to a different set of 
axes. Third, there are dynamical variables, such as 2, which may be 
written as operators or as square matrices. The elements of these 
matrices can be calculated with respect to a particular set of axes in 
Hilbert space, and a unitary matrix transforms their values from one set 
of axes or representation to another. Thus U transforms © from the 
coordinate representation (r|Q|r’) to the energy representation (k|Q(|l), just 
as it transforms |«) from the coordinate representation (r|a) to the energy 
representation (k|a). 

Alternatively, the matrix elements of 2 can be calculated with 
respect to particular state vectors ¥., which need not be members 
of a complete set. These matrix elements are the inner products 
(a2) = (a|Q\8) and are invariant with respect to unitary trans- 
formations; that is, their numerical values do not depend on the choice 
of the representation (choice of axes in Hilbert space). The result of 
operation with © on a ket |8) is a new ket 98), which is related to |8) 
through a generalized rotation of the state vector in Hilbert space. If 
[8) happens to be an eigenstate |u) of 9, then the “rotation” consists 
merely in multiplying the state vector by w, without changing its 
direction. 

A distinction must be made between rotations of axes in Hilbert 
space without change of the state vectors, which are produced by unitary 
transformations, and generalized rotations of state vectors without change 
of the axes, which are produced by dynamical variables. 

We consider in this section the way in which a physical system 
changes from one instant of time to another. As we shall see, this time 
dependence may be variously viewed as a change in the state vectors, in 
the dynamical variables, or in both. It turns out that there is one 
particular point of view that puts the equations of motion in a form very 
similar to that of the classical equations of motion in their hamiltonian 
formulation. This similarity suggests a rather general method for 
inferring quantum equations that agree in the appropriate limit with the 
classical equations for the same system. 


SCHRODINGER PICTURE 


We take as our starting point the time-dependent Schrédinger equation 
(6.16) and write it in terms of a time-dependent ket |as(t)) and the 
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hamiltonian H: - 
it © \as(t)) = Hlas(t) (24.1) 


The total time derivative is used here since the dependence of the ket on 
coordinate or other variables does not appear explicitly. The subscript 
S refers to the ket as viewed in the Schrédinger picture, according to which 
it varies in time in conformity with the ordinary differential equation 
(24.1). The hermitian adjoint equation of motion is 


— th F (as(t)| = (as()|H* = (as()|H (24.2) 


since H is hermitian. 

The solutions of the differential equations (24.1) and (24.2) are 
easily found if H is independent of the time, which we assume to be the 
case: 


las(t)) = e#*MJas(0)) — (ars(t)| = (ars(0) [ett (24.3) 


That the differential equations follow from Eqs. (24.3) may be verified 
by expanding the exponentials in power series and differentiating term 
by term. 

The operator e~*#*/* is an infinite sum of powers of H, each of which 
is a dynamical variable that can be written as an operator or a square 
matrix. Thus the series as a whole is also a dynamical variable. In 
this particular case, it is easily seen to be unitary since H is hermitian. 
However, it is essential to remember that e~‘¥*/* is not a unitary trans- 
formation in the same sense as U, V, or W. It does not transform from 
one representation to another (rotation of axes in Hilbert space); rather 
it changes the ket |as(0)) into the ket |as(t)) and so performs a generalized 
rotation of the state vector in Hilbert space. However, since e~*#*/* is 
unitary, the norm of the ket is unchanged, and the generalized rotation 
in this case is a pure rotation. 

The time rate of change of the matrix element of a dynamical vari- 
‘able Qs in the Schrédinger picture is easily found with the help of Eqs. 
(24.1) and (24.2): 


£ (as(t)|2s|Bs(t)) - E (as(b| | Qs | 6 s(t)) 


Qs E #s(0) | 


Bs()) + = (as(((QsH — Hs)|Bs(0) (24.4) 


+ (as(t | a 


Bs(t)) Ee (as(t 


= (as(t) | a 


The first term on the right side is that part of the change in the matrix 
element that arises from any explicit dependence that Qs may have on 
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the time, and the second term is that caused by the change in time of the 
state vectors. The operator that appears in this second term is called a 
commutator bracket, which is defined by 


[A,B] = AB ~ BA (24.5) 


Note that, in accordance with the definitions of Sec. 22, the commutator 
bracket has no meaning unless both matrices are square and have their 
rows and columns labeled in the same way. 

An interesting special case of Eq. (24.4) is that in which 29g commutes 
with H and has no explicit dependence on the time. Then the right side 
is zero, and all matrix elements of Qg are constant in time. Such a 
dynamical variable is said to be a constant of the motion. 


HEISENBERG PICTURE 
Substitution of Eqs. (24.3) into (24.4) gives 


g (aig(0) |e" #40. se-*##/4185(0)) = (as(0) eiHt/h as e-tHtin 


Bs (0)) 
+ 5 (as(0)|le#Ose-,H185(0)) (24.8) 


where use has been made of the fact that H commutes with e+*¥#/*, It 
is convenient to define time-independent state vectors through 


|our(t)) = |as(0)) = e*¥*/*[eg(t)) (24.7) 
and time-dependent dynamical variables through 
Qy = etHtlAQ einen . (24.8) 


Unless 2s commutes with H, Qy depends on ¢ even if Qs has no explicit 
time dependence. The subscript H denotes the Heisenberg picture. The 
time ¢ = 0 at which the kets and operators in the Schrédinger picture 


are the same as those in the Heisenberg picture is, of course, arbitrary 


and is chosen for convenience. If this time were t = to, then e+*#¢/* 
would be replaced by et*#(¢-t0)/4, 

Since |az(é)) does not depend on the time, the explicit mention of t 
may be omitted. Further, the time derivative on the left side of Eq. 
(24.6) can be taken inside the matrix element, so that the equation 
reads 


d 
(au ai Qy 
where in accordance with (24.8) we have defined 


OQy _ (2M) _ eiHtls 2s e-tHt/h 
at ot Jz at 


bx) = (an| GH | bu) + 3 (cull Hen) (24.9) 
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Since Eq. (24.9). is valid for an arbitrary bra and an arbitrary ket, it 
must be valid for the operators themselves. We are thus led to the 
equation of motion as viewed in the Heisenberg picture: 


dQx dQx 1 

cae ae + i [Qx,H] (24.10) 
This equation, which also follows directly from differentiation of Eq. 
(24.8), is the one that is most similar to the corresponding classical 
equation of motion, as we shall see below. It should be noted from Eq. 
(24.8) that H is the same in the Schrédinger and Heisenberg pictures. 

We thus see that the description of the change in time of a physical 
system may be viewed in two contrasting ways. In the Schrédinger 
picture, the dynamical variables are constant in time (except for a possible — 
explicit time dependence) and the state vectors vary in accordance with 
Eq. (24.1). In the Heisenberg picture the.state vectors are constant in 
- time and the dynamical variables vary in accordance with Eq. (24.10). 
The transformation from one picture to the other is made with the help 
of the unitary dynamical variables e+*”*/* in accordance with Eqs. (24.7) 
and (24.8). . 

We have deliberately avoided referring to these “pictures” as 
“representations,” as is sometimes done, since we wish to reserve the 
term ‘‘representation” for designation of the choice of axes in Hilbert 
space or, equivalently, the choice of the complete orthonormal set of 
functions, with respect to which the states and dynamical variables are 
specified. In this connection it is important to note that Eqs. (24.1), 
(24.7), (24.8), and (24.10) have the same form in all representations, as 
is easily shown. On the other hand, we use the term “‘transformation”’ 
in connection with the change from one picture to the other, since it is 
accomplished by means of a unitary operator. Because of this, all norms 

and inner products of state vectors are independent of the picture and 
- constant in time, as well as independent of the representation. 


__ INTERACTION PICTURE 


The discussion of the two preceding subsections shows that neither the 
“state vectors alone nor the dynamical variables alone are central‘for the 
description of the development of a physical system in time. Both can 
be altered by means of the unitary transformation e—*#*/*, and the two 
pictures thus obtained are equally valid. The quantities that are not 
altered are the matrix elements of dynamical variables calculated for 
pairs of states, such as (a|Q|8). These provide the essential physical 
content of the theory, since eigenvalues, expectation values, and transition 
probabilities can be obtained from them. It is apparent that any unitary 
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transformation, whether or not it depends on the time, may be applied to 
both states and dynamical variables, without affecting the values of the 
matrix elements. Thus an arbitrarily large number of different pictures, 
as well as of different representations, can be found. 

An especially useful third picture can be specified by dividing the 
hamiltonian into two parts, 


H = H,+ H’ (24.11) 


such that H» does not depend explicitly on the time and also has a simple 
structure. For example, Hy) might be the kinetic energy and H’ the 
potential energy. Or Ho might be the hamiltonian for a relatively simple 
potential such as the coulomb field and H’ some additional interaction 
such as an external electromagnetic field. In the latter case it is often of 
interest to permit H’ to depend explicitly on the time, and so we shall 
admit this possibility in what follows. Since H, Ho, and H’ do not in 
general commute with each other, subscripts must be used to distinguish 
between them in the various pictures. 
We define the interaction picture by the equations 


lar(t)) = e*Host/*|qs(t)) 


Oz (t) = efHost/*Q ge-tHostlh Gel?) 


so that the interaction and Heisenberg pictures are the same when 
H’ = 0. We know from earlier work that Hy = Hs, and the second of 
Eqs. (24.12) shows that Hor = Hos. In general, H; ¥ Hs, Hon ¥ Hos, 
and H',, H',, and H; are all different. Differentiation of the first of Eqs. 
(24.12) gives for the equation of motion of |a;(t)) 


. a 
th di Jar(é)) 


— Hagel |a.s(t)) + sheitonti® © Jevs(t)) 


= —Hislar(t)) + eto! AH geist |cx(£)) 
= e*Host lh FT, e—*Host /*| a7 (t)) 


= Hylar(t)) (24.13) 


where use has been made of Eqs. (24.1), (24.11), and the second of Eqs. 
(24.12). In similar fashion, differentiation of the second of Eqs. (24.12) 
gives for the equation of motion of 2;(t) 

dQr 7 dQz 1 _ 0Qr 1 

Ht Oo om Fi [Q7,Hos] = OE = i [Q7,H or] (24.14) 

What the interaction picture accomplishes is to assign part of the 
time dependence to the state vectors and part to the dynamical variables, 
instead of having one constant and the other with the time dependence 
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caused by the full hamiltonian. In this picture the state vectors change 
in accordance with H’ and the dynamical variables in accordance with 
Ho; note the resemblance between Eqs. (24.1) and (24.13) on the one 
hand, and between Eqs. (24.10) and (24.14) on the other. This is useful 
if H’ is a small disturbance or perturbation, since then the dynamical 
variables have their unperturbed forms and the state functions are nearly 
constant in time (see Sec. 35). 


ENERGY REPRESENTATION 


The energy representation is specified by the kets |k), which are eigen- 
states of the full hamiltonian H with eigenvalues E,. Equation (24.1) 
then shows that in the Schrédinger picture the time-dependent ket 
|ks(t)) satisfies the equation 


in Z [ks(t)) = H|ks(t)) = Ey|ks(t)) 


which integrates to 
lks(t)) = |kg(0))e~*2at/* (24.15) 


This is in agreement with Eq. (8.3). Equation (24.15) and its adjoint 
can be used to find the time dependence of the matrix elements in the 
Schrédinger picture of an operator that has no explicit time dependence: 


(ks(t)|@s|ls(t)) = (kg(0)|2s|lg(0))e*Ee—B0e/0 (24.16) 


It is easily verified explicitly that this is equal to the matrix element 
(ku|Qu|lx) in the Heisenberg picture, as it must be. 

Equation (24.16) shows that off-diagonal matrix elements in the 
energy representation oscillate in time with frequencies that are propor- 
tional to the energy differences between the stationary states, in accord- 
ance with Bohr’s frequency condition (see Sec. 2). This is consistent 
with the remark at the end of Sec. 23 that these matrix elements are 
related to the transition probabilities between states. It also follows 
from Eq. (24.16) that diagonal matrix elements or expectation values in 
the energy representation are constant in time for operators that have no 
explicit time dependence. 


CLASSICAL LAGRANGIAN AND HAMILTONIAN EQUATIONS OF MOTION 


In order to bring out the similarity between Eq. (24.10) and the corre- 
sponding classical equation, we review briefly the structure of classical 
hamiltonian theory. The equations of motion of a conservative dynam- 
ical system that has f degrees of freedom may be derived from a lagrangian 
function L(qi, . . - ,Q7, G1, - - - » Gy, ¢) of the coordinates q;, the velocities 
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gi: = dq;/dt, and the time, by means of a variational principle:! 


6 | Ldt =0 Sgi(ts) = dqu(te) = 0 (24.17) 
The resulting lagrangian equations are 

d {aL OL 

5 (5a) ~ ek ed GEE (24.18) 


If now we define a momentum canonically conjugate to q; as 
p: = OL /04:, and a hamiltonian function of the coordinates and momenta 
as 


f 
Hig. ** af: °° * pt) = > pi — L (24.19) 


i=1 
variation of H leads to the hamiltonian equations of motion 


gee p= — 5 ae eee (24.20) 


The time dependence of any function of the coordinates, momenta, and 
the time, calculated along a moving phase point, is 


d 
SF npr Pn =F r+) (% di + oF ps) 


_ OF oF dH  0dH oF 
-F4y (EH sn ee) 


on making use of the hamiltonian equations (24.20). The Poisson 


bracket {A,B} of any two functions of the coordinates and momenta is 
defined as 


P | 
_ a4 0B aBoA 
{A,B} = Py (= ae =) (24.21) 


In terms of the Poisson bracket, the equation of motion for the function 
F of the dynamical variables becomes 


a <5 Le 7+ AFH} | (24.22) 


1B. T. Whittaker, ‘Analytical Dynamics,” 3d ed., secs. 99, 109 (Cambridge, London, 
1927); H. C. Corben and P. Stehle, ‘‘Classical Mechanics,’”’ secs. 26, 63 (Wiley, New 
York, 1950); H. Goldstein, ‘‘Classical Mechanics,’ chaps. 2, 7 (Addison-Wesley, 
Reading, Mass., 1950); A. Katz, ‘“‘Classical Mechanics, Quantum Mechanics, Field 
Theory,” chap. 1 (Academic, New York, 1965). 
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The left side of Eq. (24.22) is the total time derivative of F along a 
moving phase point. The first term on the right side takes into account 
the explicit time dependence of F, and the last term is the change in F 
arising from the motion of the phase point at which F is evaluated. 


POISSON BRACKETS AND COMMUTATOR BRACKETS 


The strong resemblance between Eqs. (24.10) and (24.22) suggests that 
quantum analogs of the classical equations of motion can be found in 
general by substituting the commutator bracket divided by 7h for the 
Poisson bracket, 


{A,B} > 3 [A,B] | (24.23) 


and working with the Heisenberg picture. 

There are two observations that lend support to this suggestion. 
The first concerns the classical conditions for a contact transformation 
from one set of canonical variables g;, p; to another Q;, P;:! 


{Q:,P3} = 55 {9:03} = 0 {P:,P;} = 0 (24.24) 


where the Poisson brackets are calculated with respect to the original 
variables g;, p: We saw in Sec. 6 that a successful transition from 
classical to quantum theory could be made by substituting the differential 
operator —7h(d/dz) for p,, etc. The commutator of x and p, can then be 
found by letting it operate on an arbitrary function g(r) of the coordinates. 


os SS agp OU aoe Oca 
(cpz — prr)g(t) = —ihe re + ih re (xg) = thg(r) (24.25) 


Since g(r) is arbitrary, this and the other commutators may be written 
as operator equations: 


: re) 0 : 
LPz — Pot = — in ( = -%*) = th 

; FE (24.26) 
Lpy — Pyt = —th eae vie fae 
zy — yx = 0 PzPy — PyPz = 0 etc. . 


These are in agreement with the classical equations (24.24) when the 
substitution (24.23) is made. 

The second observation is that the algebraic properties of the com- 
mutator brackets are identical with those of the Poisson brackets. It is 


1 Whittaker, op. cit., pp. 300, 307; Corben and Stehle, op. cit., chaps. 11-13; Goldstein, 
op. cit., chap. 8; Katz, loc. cit. 
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readily verified from the definition (24.21) that 


{A,B} = —{B,A} {A,c} =0 where c is a number 
{(A1 + Az), B} = {AB} + (42,B} | 
{A,Ao,B} = {A1,B}A2 + A1{A2,B} 

{A,{B,C}} + {B,{C,A}} + {C,{A,B}} = 0 


(24.27) 


The order of possibly noncommuting factors has not been altered. 
Dirac! has shown that the form of the quantum analog of the Poisson 
bracket is determined by Eqs. (24.27) to be the right side of (24.23); the 
constant h is, of course, arbitrary so far as this discussion is concerned 
(see also Prob. 9). 


QUANTIZATION OF A CLASSICAL SYSTEM 


We now have available a plausible procedure for finding a quantum 
system that reduces to any specified classical system in the correspondence 
principle or classical limit. We simply write the classical hamiltonian 
equations in terms of Poisson brackets and replace the Poisson brackets 
with commutator brackets in accordance with (24.23). The classical 
equation of motion (24.22) then becomes the quantum equation (24.10), 
which states how the dynamical variables in the Heisenberg picture 
change with time. Similarly, the classical relations (24.24) become? 


(q,Pi] = thd; [gigi] = 0 [pp] = 0 (24.28) 


These quantum conditions are generalizations of Eqs. (24.26) and are 
easily seen to be valid in any of the pictures discussed earlier in this 
section. Thus the Heisenberg picture need be specified only for the 
equations of motion; we shall assume that this is the case throughout 
the remainder of this section, and the subscript H will be omitted. 

Two precautions are necessary in applying this quantization proce- 
dure. First, the coordinates and momenta must be expressed in cartesian 
coordinates. Second, ambiguities in the order of noncommuting factors 
are usually resolved by taking a symmetric average of the various possible 
orders. These precautions are illustrated in the following example. 

It is important to realize that there is no unique way of making the 
transition from classical to quantum mechanics. Terms can always be 
added to the quantum equations of motion that vanish in the classical 
or correspondence-principle limit. On the other hand, if the classical 
behavior of a system is known, certain restrictions are placed on its quan- 


1 Dirac, op. cit., sec. 21. 
2 Note that the derivation of Eq. (12.7) then shows that for any pair of canonical 
variables Agi * Api > gh. 
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tum equations. The present procedure generally yields the simplest 
quantum system that has the correct classical limit. This simplest 
system may omit interesting aspects of the true situation; thus the 
intrinsic spin and magnetic moment that some charged particles actually 
possess do not appear in the example treated below. 


MOTION OF A PARTICLE IN AN ELECTROMAGNETIC FIELD 


As an example of the foregoing quantization procedure, we consider the 
problem of the motion of a charged mass point in a general external 
electromagnetic field. The classical hamiltonian, expressed in terms of 
the canonical variables r, p and the electromagnetic potentials A(r,t), 
(r,t), is? | 


sau (2 ae y +e (24.29) 


where e is the charge on the particle and c is the speed of light; the 
electric and magnetic field strengths are given in terms of the potentials 
by 


p= —i1%_ vy H=vxA (24.30) 


The quantum conditions (24.28) in cartesian coordinates are 


[z,p=] = [y,py] = [z,p.] = th (24.31) 


with other pairs of coordinate and momentum components commuting. 
We now use Eq. (24.10), with the expression (24.29) for H and the 
relations (24.31), to calculate expressions for the particle velocity dr/dt 
and acceleration d*r/dt?, for comparison with the corresponding classical 
expressions. 


EVALUATION OF COMMUTATOR BRACKETS 
In order to facilitate evaluation of some of the commutator brackets that 
arise from substitution into (24.10), we derive a few elementary results. 
Any two functions of r commute with each other, since all components 
of r commute with each other. It follows from (24.31) that 

aps — pet? = 2(pt + th) — px? 
(pot + th)x + the — pit? = 2the 
It is readily shown by induction that 


2p: — px” = nihz™ (24.32) 


1J. H. Van Vleck, ‘The Theory of Electric and Magnetic Susceptibilities,” pp. 7, 
20 (Oxford, New York, 1932). Gaussian units are used in the present book. 
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It follows from (24.32) that for any function f(r) that can be 
expressed as a power series in x, y, 2, the relation 


[/(),pel = f(t). — pelle) = th & fer) (24.33) 


is valid.! Equation (24.33) can also be established for more general 
functions than power series by making use of the representation of pz 
as —th(0/dx), as in (24.25); if we operate with the left side of (24.33) 
on an arbitrary function g(r), we obtain 


ie),pdo(e) = —av | () 2 — 2 @ | o@ = 9@) | a2 I@ 
Ox Ox Ox 


which is equivalent to the operator equality (24.33) since g(r) is arbitrary. 
By repeated application of (24.33) it is easily shown that 


fee)pet — pasa) = in(p £4 2p.) = Zr. +e Fs 


Ox? 
(24.34) 


VELOCITY AND ACCELERATION OF A CHARGED PARTICLE 


The hamiltonian (24.29) may now be written, with the help of (24.33), 


_ Pp 
ert ame (P' A+A- P) +57 ah’ + eb 
p? e veh 
= moh p+5 = a a A t+ 5 — oa AY + ed (24.35) 


The time derivative of a component of r is then easily shown from (24.10) 
to be 


dz 1 e 
a 3 (>. -: A.) (24.36) 
in agreement with the classical relation between the velocity and momen- 


tum of a particle in the presence of an electromagnetic field. 
The calculation of a component of the acceleration of the particle 


dz 1 (% 2 das) 


dt? m\dt c dt 
1 e OA, e€ 
thm [p2,H] — me ot time [A.,H] 


1 This corresponds to the classical relation {f(r),pz} = of(r)/ox [compare with Eq. 
(24.23)]. 
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is straightforward but rather tedious. The result may be written 
de 2 (18d Bt) 4 Sel (ofa) (24) 
+ at ~ Sy) (54) | ~ aie (>. 54) Cae — 


OA, OA, _@ 
+(¥ - 2\ (e, : 4,)| (24.37) 


Equation (24.37), with the similar y- and z-component equations, can 
be written as a single vector equation for the ‘‘force’’: 


2 
mon = e(— 73 — ve) 


THE LORENTZ FORCE 


ce at 
1 
+52[5 (p24) x@ x A) — (xa) x (p—£A)| 
le (fdr dr 
= +52(G xH-HxF) (24.38) 


where use has been made of Eqs. (24.30) and (24.36). Equation (24.38) 
is In agreement with the corresponding classical expression 


eE + < (v x H) 


where v = dr/dt is the velocity of the particle, if we take a symmetric 
average of the two terms vx H and —H xv. These are identical 
classically but differ in quantum mechanics since the v given by (24.36) 
does not commute with H; also, although their sum is hermitian, the 
separate terms are not. 

Equation (24.38) includes a generalization of Ehrenfest’s theorem, 
which was discussed in Sec. 7. If we consider a diagonal element, the 
left side is the product of the mass and the second time derivative of the 
expectation value of the position vector of the particle, since the Heisen- 
berg-picture kets are independent of t. The right side is the expectation 
value of the Lorentz force acting on the charge of the particle. Thus 
(24.38) states that a wave packet moves like a classical particle if it is 
sufficiently well localized so that the electromagnetic fields change by a 
negligible amount over its dimensions. This result can also be obtained 
by the method of Sec. 7 when, in accordance with (24.1) and (24.35), 
the Schrédinger-picture wave equation is taken to be 


2,9 ( WW teh teh e? 
ha, = On boa Yt ome © AtgeaAt +eo)y 


2mc 
(24.39) 
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VIRIAL THEOREM 


A proof of the virial theorem in quantum mechanics can be given in 
analogy with the corresponding proof in classical mechanics. In the 
latter, the starting point is the time average of the time derivative of 
the quantity r- p, which is zero for a periodic system. The analogous 
quantity in quantum mechanics is the time derivative of the expectation 
value of r-p, or the diagonal matrix element of the commutator of 
r-p and A in the energy representation, which is also zero. 


d 1 

5 tp) = 3 Ue: p),Hl) = 0 

Po? + py? + pe’ 
2m 

OV OV or) 


mh 2 2 2) — »¢ pecaerite eee pee 
m (Pe + py? + pz’) in (< oe Yay toe 


[@- p),H] = | ee ap,), # Veews) | 


i 


I 


WhT — ih(r- VV) 
where 7’ is the kinetic energy. We thus conclude that 
XT) = (r+ VV) | (24.40) 


Note that it is immaterial whether we start with r- p or p-r, since the 
difference between them is a constant and hence commutes with H. 

If V is spherically symmetric and proportional to 7”, and the 
expectation values exist, Eq. (24.40) shows that 2(7T) = n(V). The 
case n = —1 is in agreement with the result of Prob. 13, Chap. 4, and 
the case n = 2 is in agreement with the results of Sec. 13. 


25LIMATRIX THEORY OF THE HARMONIC OSCILLATOR 


The linear harmonic oscillator was discussed from the point of view of 
the Schrodinger equation in Sec. 13. This system can also be treated 
by manipulating the matrix equations directly. The hamiltonian is the 
one-dimensional form of Eq. (23.2), with V(z) = 3Kz?: 


= PY 4 aKy? 

H om + 1Kzx (25.1) 
where x and 7 are hermitian since they are physically measurable dynam- 
ical variables. The quantum condition is 

xp — px = th (25.2) 


No further information is needed to obtain the energy eigenvalues and 
eigenfunctions.! 


1 Dirac, op. cit., sec. 34. 
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ENERGY REPRESENTATION 


We work at first in the energy representation, in which H is diagonal. 
Equation (25.1) may be written in Dirac notation as 


(k|H) = Exthl) = > GelplaMalolly + aK ClelsXGlel) (28.3) 


where the summation symbol S; has been omitted as in Eq. (23.33). 
Now 


ilplt) = Ulp"|s)* = Ulp|9)* 


since p is hermitian, and a similar expression holds for the matrix elements 
of x. Thus, for the diagonal elements of Eq. (25.3), the right side is a 
sum of squares of absolute values of matrix elements and hence is non- 
negative. An energy eigenvalue E; can be zero only if the matrix ele- 
ments (k|p|j) and (k|z|j) are zero for all 7. However, this would be 
inconsistent with the kth diagonal matrix element of Eq. (25.2). We 
conclude, then, that all the energy eigenvalues are positive. 

As a next step, we calculate the commutator bracket of x and of p 
with H: 


2H — Hr = my pH — Hp = —hKz (25.4) 


Equations of this type are generally useful if the representation is chosen 
so that one of the factors in the commutator is diagonal, since then a 
difference between two eigenvalues appears when a matrix element is 
taken. Thus the first of Eqs. (25.4) becomes 


(kel AALETIO) — CRLET|A)Xileld) = Br — Ex)(Rleld) = ™ Gelpldy (25.5) 
In similar fashion, the second of Eqs. (25.4) becomes 

(E, — E,)(k|p|l) = —ihK(k|z\2) (25.6) 

It follows, on elimination of (k|z|l) from Eqs. (25.5) and (25.6), that 
either (k|z|l) = (k|p|l) = 0 or 

Ej~ Ey = +h (Fy SPs (25.7) 


where, as in Sec. 13, w, is the angular frequency of the corresponding 
classical harmonic oscillator. If we were to rule out the first possibility, 
the energy eigenvalues would all differ from each other by integer multi- 
ples of hw. It is then possible to show, as will be done below, that these 
eigenvalues are equal to (n + })hw., where n is a positive integer or zero. 
If we try to exploit the first possibility in order to bring in states with 
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different energy eigenvalues, we see that the matrices that represent x 
and p are identically zero for the new states and so again are inconsistent 
with Eq. (25.2). Thus the only possible states are those whose energy 
eigenvalues differ by integer multiples of ha. 


RAISING AND LOWERING OPERATORS 


Equations (25.5) and (25.6) can also be combined in a different way by 
multiplying the first by —imw, and adding it to the second. The result 
is 

(Ei — Ex — huwc)(k|(p — tmw.x)|l) = 0 


Thus (k|(p — imw,x)|l) fails to vanish only when E, = E; — hw,. This 
means that the result of operating with p — imw. on the ket |l) is some 
multiple of the ket |k), which has an energy lower by fw,. In similar 
fashion, it is easily seen that the hermitian adjoint operator p + imw,x 
raises the energy of any state byhw.. Repeated application of the energy- 
lowering operator to any ket will ultimately lead to a difficulty, since the 
energy must be positive, unless there is a lowest energy eigenstate |0) 
such that 


(p — imw,x)|0) = 0 (25.8) 


The lowest energy eigenvalue can be found from Eq. (25.8) by oper- 
ating on it from the left with p + imw,x. The result is 


(p + imw.x)(p — imw.x)|0) = [p? + mw,2x? + imw.(xzp — px)]|0) 
= (p? + m*w,x? — mhw-)|0) = 2m(H — thw.)|0) = 0 


Thus |0) is an eigenstate of H with the eigenvalue 3hw.. Application of 
the energy-raising operator p + tmw,z to |0) gives a multiple of the ket 
|1) that has energy $fw.. Further application generates an infinite 
sequence of eigenstates that are conveniently labeled |n), with energy 
eigenvalues 


E, = (n+ dhe n=0,1,2,... (25.9) 


This is in agreement with Eq. (13.8). 

It is often convenient to put the energy-raising and -lowering oper- 
ators in dimensionless form by multiplying them with —i(2mhw,)— and 
i(2mhw.)-*, respectively. We call the raising and lowering operators 
obtained in this way at and a, respectively. It is easily seen that 

H H 1 


1 
T= oa ty = pape 
OS ee ap COs 5 


Thus a and at satisfy the commutation relation 


aat — ata = 1 (25.10) 
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and H may be written in terms of them as 
= (ata + 3)ho- (25.11) 


It follows from Eqs. (25.9) and (25.11) that the eigenvalues of ata 
are the positive integers and zero; for this reason ata is often called the 
number operator. This result is a direct consequence of the commutation 
relation (25.10) and requires no other information. The fact that the 
eigenvalues of ata, when a is any operator that satisfies Eq. (25.10), are 
the positive integers and zero is useful in connection with the quantization 
of wave fields (see Chap. 14). The operators at and a appear there ab 
initio, rather than as combinations of x and p, and have the physical 
significance of respectively increasing and decreasing the number of 
particles represented by the field. They are therefore called creation 
and destruction operators. In the same context, the number operator 
ata specifies the number of particles present in an eigenstate of the field. 


MATRICES FOR a, z, AND p 


The only nonvanishing matrix elements of a are (n — lla|n), which we 
call X,. Similarly, the only nonvanishing matrix elements of at are 
(njat|ln — 1); because of Eq. (22.13), these are equal to X\*. Then a 
diagonal matrix element of a‘a is 


(nlataln) = (nlat|n’)(n'|aln) = |rn|? 


where a summation over the complete set n’ is implied. Since this 
diagonal element is equal to 7, d» is equal to n? except for a possible 
multiplying phase factor of unit magnitude, which we can choose to be 
unity. Thus the matrices for a and a‘ are 


010 0 00 0 0 
00 2 0 10 0 0 
00 0 3: 02 0 0 
a=!|0 00 0 a’=!10 0 3 0 


(25.12) 


The product ata can be formed from these matrices; it is diagonal with 
eigenvalues 0, 1, 2, . . . , as expected. 


The expressions for a and a‘ in terms of x and 7p are readily solved 
back to give 


e2 (Cn mma) (ati a) ae i (™ee)} Ca (25.13) 
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Explicit matrices for x and p are easily found from Eqs. (25.12) and 
(25.13). Each has nonvanishing elements that border the principal 
diagonal on both sides, and they are in agreement with Eqs. (13.18). 


COORDINATE REPRESENT ATION 


The ket |0) can be found in the coordinate representation (z|0) by solving 
Kq. (25.8) in that representation. We have 


{a\(p — imwex)|\0) = (2|(p — imw.x)|2’)(x"|0) = 0 
where 

(z|p|z’) = —in d(x — 2’) and (a|r|x’) = x 6(x — x’) 
so that 


(-—th £ — imu,.xr){x|0) = 0 


This has the solution 
(x|0) = A gem (moe! 20)2* (25.14) 
where Ay is an arbitrary constant. The normalization requirement 
Olxy(xlo) = ["_ Keio)? dz = 1 


shows that Ay = (mw,./mh)* = o#/x*, in agreement with Eq. (13.19) when 
a is defined as in Eq. (13.3). 

As remarked above, repeated application of the energy-raising oper- 
ator p + imw,x to (x|0) generates all the (unnormalized) éigenstates (z|n). 
It is easily verified that for an arbitrary function g(x) 


dx th *) g(z) 


— stemostian  [e-mastiing(2)} (25.15) 


(p + tma.x)g(x) = 


| 
3: 
oom 
i: 
| 
3 
iS 


A second application of p + imw.x gives 
(p + imw.x)*g(z) = —themwe' 2 — fe~meen"/?8[ (p+ imwex)g(x)]} 
which on substitution from Eq. (25.15) becomes 


2 
( = th) 2emu_x?/2h £ [e~meez?/2h9 ()] 
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The induction to (p + imw.x)"g(x) is evident. Thus, when g(z) is chosen 
to have the form (25.14), we obtain 


dq” 
= mw .t? [2h p— mw, x7 fh 
(2|n) = Ane da © (25.16) 
where A, is a normalization constant. Equation (25.16) is in agreement 
with Eqs. (13.12) and (13.13). 


PROBLEMS 


1. Assume that any hermitian matrix can be diagonalized by a unitary matrix. 
From this, show that the necessary and sufficient condition that two her- 
mitian matrices can be diagonalized by the same unitary transformation is that they 
commute. 


2. Show that a nonsingular matrix of finite rank must be square. Also show that in 
this case the equation AA~! = 1 implies the equation A~!A = 1. 


3. Show that det (e4) = e'(4), State the conditions on A that must be assumed 
in your proof. 


4. Find two matrices A and B that satisfy the following equations: 
At=O AAt+ AtA =1 B=AitA 


where © is the null matrix and 1 is the unit matrix. Show that B? = B. Obtain 
explicit expressions for A and B in a representation in which B is diagonal, assuming 
that it is nondegenerate. Can A be diagonalized in any representation? 


5. Find three matrices A, B, and C that satisfy the following equations: 
At=Bt=CM=1 AB+ BA =BC+CB=CA+AC=0 


where 1 is the unit matrix and © is the null matrix. Obtain explicit expressions for 
all three matrices in a representation in which A is diagonal, assuming that it is non- 
degenerate. 


6. Find three matrices A, B, and C that satisfy the following equations: 
A?=B?=C?=1 BC —CB=iA 


where 1 is the unit matrix. Show that AB + BA = AC +CA =O, where O is 
the null matrix. Obtain explicit expressions for all three matrices in a representation 
in which A is diagonal, assuming that it is nondegenerate. 


7. If A and B are any two square matrices of the same rank and C = [B,A], show that 
eATB = eAgBehC 


provided that [C,A] = [C,B] = 0. 


8. If H = p?/2un + V(z) for one-dimensional motion of a particle, and V(z) can be 
expressed as a power series in z, show by purely matrix methods that 


dx op dp _ dV 
v7 


dt 


where x and p are in the Heisenberg picture. 
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9. A(z,p) and B(z,p) can be expressed as power series in x and p, and [z,p) = ih. 
Show by purely matrix methods that 


=. 4 
dm 3y [4,8] = (4,B} 


where the right side is the Poisson bracket calculated as if z and p were classical 
variables. 

10. The hamiltonian H = p?/2u + V(r) has a set of eigenkets |k) with eigenvalues 
E;. Show that, if |/) is any ket that has a discrete eigenvalue, 


h? 


Si(Ex — Ei)|(klz\1))2 = 7 


where z is a cartesian component of r. As a byproduct of your solution, give a matrix 
proof of Prob. 9, Chap. 3. 
11. The hamiltonian for the harmonic oscillator may be written in the coordinate 
representation as 

(olH|x'y = — © se _ 9) + 4Krt ae — 2’) 

~ 2m dz? 

Transform H into the momentum representation, and find the p-representation wave 
functions that correspond to the z-representation wave functions given in Eq. (13.13) 
or (25.16). 
12. Show that the matrix for z obtained from Eqs. (25.12) and (25.13) is in agreement 
with Eqs. (13.18), and that Eq. (25.16) is in agreement with Eqs. (13.12) and (13.13). 


7 
Symmetry in 
Quantum Mechanics 


Symmetry is a fundamental attribute of the natural world that enables 
an investigator to study particular aspects of physical systems by them- 
selves. For example, the assumption that space is homogeneous, or 
possesses translational symmetry, leads to the conclusion that the linear 
momentum of a closed isolated system does not change as the system 
moves. This makes it possible to study separately the motion of the 
center of mass and the internal motion of the system. In similar fashion, 
the assumption that space is isotropic, or possesses rotational symmetry, 
means that the total angular momentum of such a system is constant. 
Connections of this kind between symmetry properties and conservation 
laws have been used earlier in this book without developing an elaborate 
formalism, as in the reduction of the Schrédinger equation for the hydro- 
gen atom from seven independent variables to one (Sec. 16). However, 
a systematic treatment is useful in solving more complicated problems. 
More important, the unified view of symmetry that results provides a 
deeper insight into the structure of physics. 

In this chapter we consider first the geometrical symmetries that may 
be associated with the displacements of a physical system in space and 
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time, with its rotation and inversion in space, and with the reversal of 
the sense of progression of time. We then discuss the dynamical sym- 
metries that lead to the unexpected degeneracies of the energy levels of 
the hydrogen atom and the isotropic harmonic oscillator. Several other 
symmetries that are of interest in physics are omitted, notably those that 
relate to molecules, crystals, and relativity. Although we shall consider 
mainly a single particle, or equivalently a pair of particles in the center- 
of-mass system, many of the results obtained can be extended to several 
interacting particles provided that the symmetries apply to all coordinates 
of all particles. For identical particles there is also an additional permu- 
tation symmetry, which will be discussed in Sec. 40. 


26LISPACE AND TIME DISPLACEMENTS 


We shall work entirely in the Schrodinger picture. As our first symmetry 
operation, we consider the displacement in space of a physical system in 
a state represented by the ket |a) or the wave function y,(r). The dis- 
placement is through a vector » and changes the ket |a) into the ket |a’) 
or changes the wave function y.(r) into the wave function y~(r). This 
means that 


Ya (rt + 9) = a(t) (26.1) 


As an example of (26.1), we note that, if ¥.(r) is a wave packet that has 
its maximum value when its argument r is equal to ro, then y.-(r) is a 
wave packet of the same shape that has its maximum value when its 
argument r is equal toro + 9. Thus ¥ has its maximum displaced by 
e with respect to the maximum of wz. 

‘What we have just described is usually called the active point of 
view, in which the coordinate system is unchanged and the state function 
is displaced. The completely equivalent passive point of view consists 
in leaving the state function unchanged but referring it to a coordinate 
system that is displaced by the vector — with respect to the original 
coordinate system. For the situations discussed in this book, the active 
point of view seems to be more appropriate physically, and we shall use 
it consistently. However, it should be noted that most treatments of 
homogeneous Lorentz transformations make use of the passive point of 
view, which seems more suitable when relative motion (rather than rela- 
tive displacement) is being considered; the same physical system is then 
viewed by observers who are moving with respect to each other. 


UNITARY DISPLACEMENT OPERATOR 


Two kinds of transformations were discussed in Secs. 23 and 24. The 
first kind, of which U, V, and W are examples, are unitary transformations 
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from one representation to another that rotate axes in Hilbert space with- 
out change in the state vectors. The second kind, of which Q and e—*#+#/4 
are examples, are dynamical variables that produce generalized rotations 
of the state vectors in Hilbert space without change in the axes. Wenow 
wish to find a transformation of the second kind that changes the ket 
|~) into the ket |a’); since the norm of the ket should not be affected by 
displacement, the corresponding operator is expected to be unitary, and 
this will be shown by explicit construction. We call this operator U,(@), 
where the subscript denotes a displacement in space and the argument 
is the vector displacement interval [in contrast with U,(r) considered 
below, which is the operator for displacement in time through the 
interval 7]. 
We thus have 


U,(@)|a) = |a’) or U,(o)Walt) = ar(r) (26.2) 


The second of Eqs. (26.2) makes use of the coordinate representation 
of the state vector; together with Eq. (26.1) it becomes 


U,(e)Palt) = alr — @) (26.3) 


In evaluating ¥.(r — 9) it is convenient to choose the coordinate axes 
at first so that the x axis is in the direction of the vector 9. Then a 
Taylor’s series expansion gives 


Volt — 0) = Velt — 9, 4s 2) = Yalts442) — x Valt¥,2) 


p? oe? _ 
ae DI ax? Va(x,Y,2) 


The right side may be written in the form 
ee C9182) (254,2) 


For a general choice of coordinate axes, p(@/dx) may be replaced by 0 - V, 
so that we obtain 


a(t — 0) = exp (—e- V)¥a(t) = exp (=% : 3) va(r) 


where the momentum operator p = —zhV has been introduced. Thus 
if we choose 


U,(e) = exp oP , (26.4) 


Eq. (26.3) is valid for all-state vectors. Further, since the operator V, 
which is defined only in the coordinate representation, has been replaced 
by ip/h, Eq. (26.4) is valid in all representations. It is easily verified 
that U,(e) is unitary, since @ is real and p is hermitian. 
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EQUATION OF MOTION 


A physical system in a state represented by any ket |a(t)) at a particular 
time ¢ can always be displaced in space through the vector @ by making 
use of Eqs. (26.2) and (26.4). However it need not be true that the 
kets |a’(t)) obtained in this way at different times will represent a 
possible motion of the system. In other words, if |a(t)) obeys the 
Schrédinger equation of motion (24.1), it may or may not be true that 
|x’(t)) = U,(g)|a(t)) also obeys this equation. In order to see whether 
or not this is the case, we calculate the time derivative of |a’(¢)): 


it 5 Ja’(d) = #hU,(0) © Ja(t)) = Us(e)Hla()) 


U,(9) HU,*(@)|a’(t)) 
This agrees with Eq. (24.1) if and only if 
U,(o)HU,"(9) = H or [U-(e),H] = 0 (26.5) 


If @ is allowed to be any vector, Eq. (26.5) is valid only if p commutes 
with H. Then, in accordance with the discussion of Eq. (24.4), the 
momentum operator is a constant of the motion. It also follows from 
Sec. 22 that p and H can be diagonalized simultaneously, so that a 
single state can have well-defined eigenvalues for both momentum and 
energy. 

Thus we have shown that a physical system that can be displaced 
in space and still be a possible physical system can be characterized by 
a constant and well-defined value of the momentum, as well as of the 
energy. Such a system is said to possess space-displacement symmetry 
or space-displacement invariance. This result is reasonable, since a 
particle can be space-displacement symmetric or invariant only if no 
force acts on it, in which case its momentum is constant. A free electron 
possesses this symmetry, whereas the electron in a hydrogen atom does 
not. In the latter case, displacement of the wave function yields a wave 
packet that retains its form only momentarily. 


SYMMETRY AND DEGENERACY 


An important aspect of symmetry has to do with its relation to the 
degeneracy of energy eigenvalues. Suppose that a ket |a) represents an 
energy eigenstate of a system, so that H|a) = E,|a), and that there is 
some operator 2 that commutes with H. Then it is easily seen that 
Q|a) is also an eigenstate of H with the same energy eigenvalue E,. Thus, 
if Q|a) is linearly independent of |a), this energy eigenvalue is degenerate. 


1 The subscript S that denotes the Schrédinger picture is omitted here, since only 
this picture will be used throughout this chapter. 
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Since a space-displaced state is generally linearly independent of the 
original state, space-displacement symmetry (for which 2 = p) gives 
rise to energy degeneracy of momentum eigenfunctions. This is not a 
new result, since we know that the energy of a free particle depends only 
on the magnitude, not the direction, of its momentum. An exceptional 
situation arises when the state is constant in space, since then the space- 
displaced state is not linearly independent. Such astate has zero momen- 
tum and is nondegenerate. 

Energy degeneracy associated with a geometrical symmetry such as 
space displacement is usually easy to identify. In Sec. 30 we shall con- 
sider two less obvious examples, of dynamical rather than geometrical 
symmetry, where there is also degeneracy. The problem then is to find 
the operator Q, analogous to p above, that commutes with the hamiltonian 
and hence defines the symmetry. 


MATRIX ELEMENTS FOR DISPLACED STATES 


A matrix element of a dynamical variable 2 can be calculated for any 
pair of states and compared with the matrix element of the same operator 
when the states are displaced. The first matrix element is (a|9|8), and 
the second is 


(a|2|8’) = (a|U,"(9)2U,(e)|8) 


It is apparent that matrix elements of any function of the momentum 
operator are unchanged when the states are displaced. On the other 
hand, if Q is the coordinate operator r, it can be shown that 


U,t(o)rU-(0.) =r +e (26.6) 


Thus the matrix elements of r for the displaced states are equal to 
the corresponding matrix elements of r + @ for the original states, as 
expected. 


THE GROUP CONCEPT 


The branch of mathematics that is appropriate for a full treatment of 
symmetry is the theory of groups. Although we shall make essentially 
no use of the formal aspects of group theory in this book, it is often 
convenient to refer to some of its ideas and terminology. We therefore 
give a few basic definitions here.! 


1 For fuller discussions of symmetry and group theory, see E. P. Wigner, “Group 
Theory” (Academic, New York, 1959), especially chaps. 15, 20, and 26; M. Hamer- 
mesh, “Group Theory and Its Applications to Physical Problems’’ (Addison-Wesley, 
Reading, Mass., 1962), especially chap. 8; A. Messiah, ‘Quantum Mechanics,” 
vol. II, chap. XV and App. D (North Holland Publishing Company, Amsterdam, 
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A set of objects a, b,c, . . . form a group if a process can be defined 
that enables us to combine any two of the objects, such as a and 5, to 
form an object ab, and if the following conditions are satisfied. 


1. All results of combination, such as ab, are members of the group. 

2. The group contains an identity or unit member e that has the proper- 

_ ties ae = ea = a, where a is any member of the group. 

3. Each member a has an inverse a~, also in the group, such that 
aa = aa =e. 

4. Group combination is associative, so that (ab)c = a(bc). 


The members of the group are called elements, and the combination 
process is often called multiplication, even though it may not be related 
to arithmetic or matrix multiplication. For example, the integers 0, 
+1, +2,... form a group if the combination process is ordinary 
arithmetic addition; in this case the identity or unit element e is the 
number 0. 

A group is abelian if multiplication is commutative, so that ab = ba 
for all pairs of elements. A group is continuous if its elements can be 
labeled by one or more continuously varying parameters. It is contin- 
uously connected if a continuous variation of the group parameters leads 
from any element to any other. A continuous group is compact if every 
infinite sequence of elements of the group has a limit element that is also 
in the group. Two groups are said to be isomorphic to each other if 
there is a unique one-to-one correspondence between elements of the two 
groups such that products of corresponding elements correspond to each 
other. If a group is isomorphic to another group, the elements of which 
are matrices, the latter is said to be a matrix representation of the former. 

It is easily established that all displacement vectors @ form an 
abelian, continuously connected, three-parameter, noncompact group, for 
which the multiplication process is vector addition. Also, the operators 
U,(e) form a group that is isomorphic to the group of displacement 
vectors. In this case the multiplication process is successive operation 
and in the case of a matrix representation of the operators is matrix 
multiplication. 


1962); M. Tinkham, ‘Group Theory and Quantum Mechanics” (McGraw-Hill, New 
York, 1964), especially chap. 5; M. L. Goldberger and K. M. Watson, ‘Collision 
Theory,’’ chap. 2 (Wiley, New York, 1964); P. Roman, “Advanced Quantum Theory,” 
chap. 5 and App. 2 (Addison-Wesley, Reading, Mass., 1965); H. J. Lipkin, “Lie 
Groups for Pedestrians’? (North Holland Publishing Company, Amsterdam, 1965); 
R. Hermann, “Lie Groups for Physicists’? (Benjamin, New York, 1966); A. Katz, 
“Classical Mechanics, Quantum Mechanics, Field Theory,” chap. 2 (Academic, 
New York, 1965). 
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TIME DISPLACEMENT 


We consider next the aetasensaes of a state represented by the ket |a(t)) 
in time through the interval 7, which changes it into the ket |e’(t)). In 
analogy with Eq. (26.1), this means that 


|a’(té + 7)) = la(d)) (26.7) 


As in the first of Eqs. (26.2), we define an operator U,(r) that produces 
a generalized rotation of the ket {a(¢)) into the ket |a’(¢)): 


U(r) |a(t)) = |e’ (t)) (26.8) 
Combination of Eqs. (26.7) and (26.8) gives 
U,(r)|a(t)) = |a(t — 7)) (26.9) 


The right side of Eq. (26.9) can be reduced, in analogy with Eq. (26.3), 
to give 


la(t — r)) = e774/44a(t)) (26.10) 


Since our kets are being viewed in the Schrédinger picture, Eq. 
(24.1) enables us to replace (d/dt)|a(t)) by (th)-'Hla(t)). However, 
(d?/dt?)|a(t)) can be replaced by (ih)-?H?|a(t)) only if H is independent 
of the time, and a similar remark applies to the higher time derivatives 
that appear in Eq. (26.10). As in Sec. 24, we assume that H is constant 
and obtain 


U(r) = eitHin (26.11) 


Equation (26.11) is a valid expression for the time-displacement operator 
if it is applied to any ket that represents a physical state, that is, any ket 
that satisfies the Schrédinger equation (24.1). Since U,(r) commutes 
with H, it is easily seen that the time-displaced ket |a’(t)) given by 
Kq. (26.8) also obeys the Schrédinger equation. Thus the assumption 
that the hamiltonian or energy operator is a constant leads to the con- 
clusion that a physical system can be displaced in time and still be a 
physical system. Such a system is said to possess time-displacement 
symmetry or invariance. 

Although intuitively obvious, it is slightly more difficult to show 
that if H depends on the time the system does not possess time-displace- 
ment symmetry. In this case the expression for U,(r) is somewhat more 
complicated than Eq. (26.11). However, it is enough to consider the 
case in which 7 is infinitesimally small and only terms of first order in + 
are retained. Then U;,(r) is given approximately by 1 + irH(i)/h and 
evidently depends on ¢ as well as on. Then, in order to see whether or 
not |a’(é)) satisfies the Schrédinger equation, we calculate its time 
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derivative from Eq. (26.8): 

it F lal () = — 1S lat) + Ue) Hla) 
To first order in 7, the right side is equal to 

a7 FF ha'() + H@la') 


so that |a’(t)) does not obey the Schrédinger equation. In other words, 
if the hamiltonian changes with time, the time-displaced system finds 
itself in an altered environment and hence will not develop properly in 
time. 

In the case in which H is constant, it might at first be thought that 
there is a contradiction between the first of Eqs. (24.3), 


la(t)) = e-*##/4|@(0)) 
and Eq. (26.8), which with the help of (26.11) reads 
la’ (t)) = e*# l(t) 


That they actually agree may be seen by setting ¢ = —7 in the first 
equation and ¢ = 0 in the second, in which ease they become 


la(—r)) = e*#/*|q(0)) and \a’(0)) = e'#/*lq(Q)) 


respectively. Equation (26.7) with t = —7r then shows that the left 
sides are equal. This may be put into words by saying that |e’(t)) has 
the structure at ¢ = 7 that |a(¢)) has at t = 0 and hence may be obtained 
from |a(r)) by letting it develop backward in time from r to 0 in accord- 
ance with Eq. (24.3). 


27ROTATION, ANGULAR MOMENTUM, AND UNITARY GROUPS 


As our next symmetry operation we consider the rotation in space of a 
physical system in a state represented by the ket |a) or the wave function 
¥a(r). We describe a rotation by a linear operator R, which is so defined 
that any vector r is rotated into the vector Rr. The rotation changes 
the ket |a) into the ket |’) or changes the wave function y,(r) into the 
wave function y.(r). In analogy with Eq. (26.1), this means that 


Ya (ir) = ya(r) | (27.1) 


PROPER ROTATION GROUP 


Each rotation operator may conveniently be represented by a 3 X 3 
matrix. We write the rectangular components of an arbitrary vector r 
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‘and the rotated vector re as column matrices, so that the equation 
rz = Rr is written as the matrix equation 


ZR Rew Ry Ru x 
Yr} =| Ryze Ry Ryz 7] (27.2) 
2R Riz Ry Kiss z 


The requirement that the components of rz be real when the components of 
r are real means that the elements of the matrix R are real. Further, 
the requirement that the scalar product of any two vectors be unchanged 
when both vectors are subjected to the same rotation can be used to show 
that R is orthonormal: the three rows of R are orthonormal to each 
other, the three columns of R are orthonormal to each other; also, the 
determinant of R is equal to +1. We shall consider in this section only 
proper rotations, for which the determinant is equal to +1. Since the 
determinant is not zero, the inverse transformation R—! exists and has 
the property r = R-'rp. It can also be shown that R—! is the transpose 
of R: (R-),; = Ry, where i,j = x,y,z. Finally, since there are six inde- 
pendent constraints on the nine matrix elements R,;, all rotations can be 
described by three continuously varying parameters. . 

The matrices R satisfy all the requirements of a group. The com- 
bination process is matrix multiplication, which is associative. Multi- 
plication of two matrices with the above properties leads to a third with 
the same properties. The matrix 6,; is the identity element, and each R 
has an inverse. An example is easily devised which shows that rotations 
do not in general commute, so that the group is not abelian. Thus the 
matrices R form a continuously connected, three-parameter group that 
is easily seen to be compact. This group is designated O(3), the ortho- 
' gonal group in three dimensions, which is the set of all 3 X 3 real 
orthonormal matrices with determinant equal to +1. 

A continuously connected group in which the parameters of the 
product of two elements are continuous, differentiable functions of the 
parameters of the elements is called a Lie group. The space- and time- 
displacement groups are noncompact Lie groups, and the rotation group 
is a compact Lie group. The Lorentz group has three parameters to 
describe rotations and three more parameters to describe velocity changes. 
It is a six-parameter Lie group that is, however, noncompact, since there 
is no transformation that corresponds to the limiting value c of the veloc- 
ity change in any direction. 


GEOMETRICAL ISOMORPHISM 


Each matrix R corresponds to a physical rotation. Hence it can be 
represented by a vector @ whose direction is the axis about which rotation 
occurs (always assumed to be in the clockwise direction when looking 
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out from the origin along }), and whose magnitude is the angle of rotation 
in radians. The endpoints of the vectors that represent all possible 
rotations fill a sphere of radius 7; diametrically opposite points on the 
surface of this sphere represent the same rotation. The process by which 
two vector endpoints within the sphere are combined to form a third 
point in the sphere is complicated when described in terms of the coordi- 
nates of the first two points but can be determined by putting the three 
points into one-to-one correspondence with two of the R’s and their 
matrix product. Thus such points in a sphere of radius 7 form a group 
that is isomorphic to the rotation group.! Itis, however, more of pictorial 
than of calculational value, since the weight function for the group space 
is not constant. 

This geometrical isomorphism can be used to show that the rotation 
group, although continuously connected, is not simply connected. This 
means that the group parameters may be varied along more than one 
continuous path leading from any element to any other, so that these 
paths are not continuously deformable into each other. In the present 
case there are two distinct paths. One of them goes from element R, to 
element R, entirely within the sphere. The other goes from Ri to the 
surface of the sphere, reappears at the diametrically opposite point, and 
then goes to Ry. A path that makes two jumps between diametrically 
opposite surface points can be reduced to the first kind of path, and so 
on. Thus the rotation group is doubly connected. 


INFINITESIMAL ROTATIONS 


A great deal can be learned about the structure of a Lie group from a 
study of the elements that are infinitesimally close to the identity ele- 
ment. As might be expected, this is much simpler than a study of the 
full group. If the vector 9 is of infinitesimal length and only quantities 
of first order in are retained, the relation rg = Rr may be written 


Ir~r+oxer (27.3) 
Combination of Eqs. (27.2) and (27.3) shows that 
1 —¢: dy 
R=| ¢. 1 — dz (27.4) 
— dy Gz 1 ; 


We now wish to find a transformation Ur(¢) that changes the ket 
|) into the ket |a’) or changes the wave function y,(r) into the wave 
function yYa(r): 


Ur(})¥a(t) = var(r) (27.5) 
1 Wigner, op. cit., pp. 90, 249. 
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Combination of Eqs. (27.1) and (27.5) gives 
Ur($)¥a(t) = Ya(R*r) 
= Walr — > X 1) 
= Palt) — (> X 1) - V(r) 


P . 
= Va(r) =n h (> x1)> pya(r) 
in analogy with the derivation of Eq. (26.4). We may thus put 
Un(o) ~1—Fo-L (27.6) 


where in accordance with Eqs. (14.19) 
L=rxp (27.7) 


is the operator for the angular momentum of the particle about the 
origin. 

The three operators L,, L,, and L, are called the generators of the 
infinitesimal rotations about the three coordinate axes through the angles 
¢z, $,, and ¢,, respectively, in accordance with Eq. (27.6). In similar 
fashion, the infinitesimal forms of Eqs. (26.4) and (26.11) show that p 
and —H are the generators of the infinitesimal displacements in space 
and time, respectively. The fact that the components of p commute 
with each other leads directly to the conclusion that the space-displace- 
ment group is abelian. On the other hand, the components of L do not 
commute with each other and so the rotation group is not abelian. 

The equation of motion of the rotated ket |e’) can be found in exact 
analogy with the discussion preceding Eq. (26.5). The condition that 
the system possess rotational symmetry or invariance is that Ur() 
commute with H for all @ or that the generators Lz, L,, and L, commute 
with H. This last means that the angular momentum L is a constant 
of the motion, as is expected for a system that possesses rotational sym- 
metry. It also means that there is energy degeneracy if L|a) is linearly 
independent of |a). 


SPIN OF A VECTOR PARTICLE 


The fact that the generators of infinitesimal symmetry transformations 
are equal to recognizable dynamical variables in the simple situations 
thus far considered suggests that they be used to define dynamical vari- 
ables in more complicated situations. As an example, we now consider 
a particle possessing internal degrees of freedom that can be related to 
an intrinsic spin angular momentum. The particle is assumed to be 
described by a vector wave function t.(r) instead of by a scalar wave 
function ¥.(r) such as has been considered thus far. Further, we assume 
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that, when the state is rotated, not only does r change into Rr but also 
tz changes into Rua. Thus Eq. (27.1) becomes 


ta (Rr) = Ry.(r) (27.8) 
In similar fashion Eq. (27.5) becomes 
Or($) hat) a ta (1) (27.9) 


where Ur($) is now a tensor operator since it changes the vectorial 
character of .. Combination of Eqs. (27.8) and (27.9) gives to first 
order in } 
Ur(>)ta(r) = Ry.(R-'r) | 
= ta(R-'r) + > X da(R-'r) 
=~ te(r — Xr) + $ X te(r) 


~ alt) — £ (6+ Lele) + 6 X ale) (27.10) 


The tensorial character of Ur($) is most conveniently exhibited by 
writing it as a 3 X 3 matrix that multiplies the vector ta to produce the 
vector tq’, in analogy with Eq. (27.2). The first two terms on the right 
side of Eq. (27.10) are then proportional to the unit matrix. Comparison 
of Eqs. (27.3) and (27.4) shows that the third term may be written in 
the form 


& X dale) = — £ (+ Sele) 


where the components of S are the following 3 X 3 matrices: 


0 0 0 00 1 
Sz = th|0 0 -1 Sy = th 0 0 | 
0 1 0 —1 0 0 
0 —1 0 
S. = h| 1 0 0 (27.11) 
0 0 0 


With this definition of S the infinitesimal rotation transformation is 
Ur(o) ~1—5o-(L+S) (27.12) 


The generators of the infinitesimal rotations are the three com- 
ponents of the vector 


J=L+S (27.13) 


which we identify with the total angular momentum of the particle. The 
orbital angular momentum operator is L, and it acts only on the r depend- 
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ence of t.(r) without affecting its components. The operator S is called 
the spin angular momentum, and it rearranges the components of ta (r) 
without affecting its r dependence. Thus L and S commute with each 
other. It often happens that L and S do not separately commute with 
H, whereas J does. Physically this means that the hamiltonian contains 
terms that couple spin and orbital angular momentum to each other but 
not to the surroundings. 

It was found in Eq. (14.22) that the possible eigenvalues of 
L? = L,?7+ L,? + L,? are (1 + 1)h?, where / is a positive integer or zero 
and is called the orbital angular momentum quantum number. Direct sub- 
stitution from the matrices (27.11) shows that S? = S,? + S,? + S,? is 
equal to 2A? times the unit matrix. This corresponds to / = 1 in the 
orbital case, so that the particle carries spin angular momentum h. This 
conclusion derives from the assumed vectorial transformation property 
of the infinitesimal elements of a Lie group determine the main character- 
istics of the structure of the entire group, since in effect they specify how 
the group elements may be integrated to a finite distance from the identity 
element. 


COMMUTATION RELATIONS FOR THE GENERATORS 


It was remarked above that the components of L do not commute with 
each other, and so the rotation group is not abelian. More generally, 
it can be shown that the commutation relations between the generators 
of the infinitesimal elements of a Lie group determine the main character- 
istics of the structure of the entire group, since in effect they specify how 
the group elements may be integrated to a finite distance from the identity 
element. 

The commutation relations between the components of L are readily 
found from Eq. (27.7) and the quantum conditions (24.26). The results 
are 

[Lz,L,y] = whl, [L,,L,] = tab, [L.,L.| = hl, (27.14) 


Direct substitution from Eqs. (27.11) shows that the components of S 
satisfy the same relations. Then, since L and S commute with each 
other, the components of J also satisfy Eqs. (27.14). These three equa- 
tions are often written in the equivalent form 

JxjJ=ajJ (27.15) 

The relations (27.15) have been established only for the total angular 
momentum of spin zero (scalar) and spin one (vector) particles. Never- 
theless we adopt them as the defining equations for the angular momen- 
tum and for the rotation group generators in the general case. The 
generators of a Lie group form the basis of a Lie algebra; this algebra is 


1 Wigner, op. cit. 
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closed in the sense that the commutator of any pair of generators is a 
linear combination of the generators. Our next task will be to solve the 
algebraic relations (27.15) in order to obtain explicit matrices for the 
J’s and hence for the infinitesimal elements of the group. 

Once the J matrices are known, it is an easy matter to extend the 
infinitesimal rotation operators given by Eqs. (27.12) and (27.13) to 
finite rotations. Suppose we wish to find Ur(¢) for some finite rotation 
. We choose coordinate axes so that one of them, say the x axis, is 
along. Then the effect of a small increase in the magnitude of $, from 
¢ to @ + Ag@, is to follow the finite rotation Ur(@) by the infinitesimal 
rotation 1 — (1/h) A¢Jz, to give 


Ven @ ( 7 j Aes.) Ua(¢) 
Thus Ur(¢) satisfies the differential equation 


OHO = — F709) 
together with the boundary condition Ur(0) = 1. This equation is 
readily integrated to give 

Un(o) = e~ivein 
Since no restriction was involved in the choice of the x axis for 6, we 
obtain 


Ur(o) = ep =#'J (27.16) 


as the operator for finite rotations. 


CHOICE OF A REPRESENTATION 


We now solve Eqs. (27.15) of the Lie algebra in a manner similar to that 
used for the harmonic oscillator in Sec. 25. We have three hermitian 
dynamical variables, J., Jy, and Jz, no one of which commutes with any 
other. Thus not more than one of them can be made diagonal. How- 
ever we can construct J? from them, and it is easily seen that it commutes 
with each component of J. In analogy with the energy representation 
in the oscillator problem, we now choose a representation in which J? and 
one of the components, say J., are diagonal. 

Just as in the oscillator problem it was more convenient to work 
with p + imw,x than with x and p, so it proves to be more convenient 
now to work with J, + 2J, than with J; and Jy. We therefore define 
the nonhermitian operators 


J, =J2t+ wy J_-=J,—ty (27.17) 


which are hermitian adjoints of each other and play a role similar to the 
raising and lowering operators in the oscillator problem. It is evident 
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that Jz commute with J?, and the other commutation relations become 
[JaJ4] = AJ 4 [J.J] = —hJ_ [J.J] = 2hJ, (27.18) 

’ We label the rows and the columns of our representation with a 
pair of numbers j and m. It is convenient to choose the eigenvalues of 
J, to be equal to mh, where m is a set of dimensionless real numbers. 
However, the relation between j and the eigenvalues of J? will be left 
open for the present; these eigenvalues will be written f(j)h?, where f(J) 


is a dimensionless function of j. In our representation, then, the matrices 
for J? and J, are 


(jm|J2| jm") = f(A)? By" Bmme = (mJ. jm!) = mh 437 bmm (27.19) 
Further, since Jz commute with J’, they are diagonal in j although not 
in m. 

The first of Eqs. (27.18) may be written in this representation as 

(jm|J.|g’m' i'm! |J |i’) — Cjm|d4|9m' Gm | |9'm") 

= n(jm|J+|7’"m") 

where the summation symbols for 7’ and m’ have not been written 

explicitly. Both sides of this equation vanish unless f(j) = f( J) =f"). 
We then obtain, with the help of the second of Eqs. (27.19), 

(m — m” — 1)(jm|J4|j7m”) = 0 (27.20) 


Thus (jm|J,|jm”) fails to vanish only when m = m” + 1. This means 
that the result of operating with J, on the ket |jm’’) is some multiple 
of the ket |j, m’’ + 1), so that J, is a raising operator on the eigenvalues 
of J,. In similar fashion, the second of Eqs. (27.18) leads to 


(m — m” + 1)(jm|J_|jm’’) = 0 . (27.21) 


so that J_ is a lowering operator. 
We can thus write the nonvanishing matrix elements of J, and its 
hermitian adjoint J_ in the forms 


(j, m+ US alg) = Ami (jm|J_|\j,m + 1) = Ath (27.22) 


Substitution of the matrix elements (27.22) into the third of Eqs. (27.18) 
shows that the off-diagonal elements vanish on both sides, and a typical 
diagonal element is 


lAm—1|2 — |Am|? = 2m (27.23) 
It should be noted that A, may depend on j as well as on m. 


VALUES OF m, f(j), AND Am 


Equation (27.23) is a first-order linear difference equation in |A,,|?, and 
its general solution has one arbitrary constant: 


[Aml? = C — m(m + 1) (27.24) 
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Now |A»|? is necessarily positive or zero, and yet the right side of (27.24) 
evidently attains negative values for sufficiently large positive and nega- 
tive values of m. This does not cause difficulty if there are two values . 
my, and mz of m for which ),, = 0 and if these two values differ from each 
other by an integer. If this is the case, the series of m values in which 
successive terms differ by unity can terminate at both ends without 
|Am|? becoming negative. Equation (27.20) can be satisfied at the upper 
end (m = m,) by having (j, m: + 1|J4|jm1) = 0 rather than by having 
an eigenvalue of J, greater than mih. Similarly, Eq. (27.21) can be 
satisfied at the lower end (m = mz) by having (jm2|J_|j, m2 + 1) = 0 
rather than by having an eigenvalue of J, less than (m2 + 1)h. Irmnl 2 
is evidently nonnegative for m values that nallge from mz + 1 up to m, 
inclusive. 

We thus have a finite series of values of m ranging from m; down 
to mz + 1 by unit steps, where m; and mz are the larger and smaller 
roots of the quadratic equation C — m(m + 1) = 


m = —%+3(1 + 4C)} Mm, = —}—3(1 + 4C)3 


Since m: + 1 is equal to —mi, the series of m values ranges from m, to 
—m, by unit steps, which means that 2m, must be a positive integer or 
zero. Thus my is restricted to the series of values 0, 3, 1, $, . . . and 
C = mi(m, + 1). . 

In order to evaluate f(7), we calculate the jm diagonal matrix ele- 
ment of J? = $(JiJ_ + J_J) +J,2. The result is 


SD)H? = F[Nnal?h? + S[Am[2A? + meh? 
= Ch? = m(m + 1h? 


where use has been made of Eq. (27.24). We rename m, which is the 
largest value of m for a given series, and call it j, so that the eigenvalues 
of J* are j(7 + 1)h?, where j is zero or a positive integer or half an odd 
integer. For each value of j, there are 27 + 1 values of m that range 
by integer steps from j to —j. The nonvanishing matrix elements of the 
raising and lowering operators are obtained from Eggs. (27.22) and (27.24) 
and may be written in various equivalent ways: 


Gj, m + 1J4ljm) = [77 + 1) — m(m + 1A 
| =[G-—m)G+m-+ 1} 


(j,m — A|J_|jm) = [jG +1) — mm — 1A 
=[G+ mG —m+ IP (27.25) 


J.|jm) = mi\jm) 
Jsljm) = [79 + 1) — m(m + I)]Aly, m + 1) 
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An arbitrary phase factor has been set equal to unity, in conformity with 
the usual convention. 


ANGULAR MOMENTUM MATRICES 


Our choice of a representation in which J? and J, are diagonal has led to 
discrete sequences of values for the corresponding labels 7 and m. The 
infinite matrices thus obtained are most conveniently handled by break- 
ing them up into an infinite set of finite matrices, each of which is char- 
acterized by a particular value of 7 and has 27 + 1 rows and columns. 

For j = 0, J? and the components of J are all represented by null 
matrices of unit rank: [0]. The matrices for the next three values of 7 
are obtained by using Eqs. (27.17) and (27.25): 


j=4: 
0 1 0 -1 
ree et 
mens) eal] 
1 oO —1 0 
= 1 2— 3 
ue “ar ed J m5 ‘| 
j=l 
Ft ee ae a os 
, ir nny y= oa 
V2lo 1 0 210 ¢§ O 
1 oO 0 100 
J,=h10 OO 2= 2710 1 0 
0 oO -1 00.1 
24 (27.26) 
0 V3 O 0 
7, =| V3 0 2 O° 
ili (om | 2 0 V3 
0 0 V3 O 
0 —-iW/3 0 0 
_ 42) 3 0 —2i 0 
Jy = th 0 Qt 0 —in/3 
0 0 inf3 1) 
3 oO oO O 100 0 
0 1 oO oO 0100 
= = 1572 
Je = ah! 9 0 -1 O P= 7) 9 0 1:0 
0 oO oO -8 0001 
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CONNECTION WITH SPHERICAL HARMONICS 


Comparison of the foregoing results with those of Sec. 14 suggests a 
close connection between the angular momentum matrices, for which 
J = lis an integer, and the spherical harmonics Y.,,(6,¢) defined in Eq. 
(14.16). The eigenvalue equations (14.22) and (14.23) are consistent 
with the diagonal forms of the J? and J matrices, respectively. Further, 
the Ls can be expressed as differential operators with the help of Eqs. 
(14.20): 


Ly = Le + ily = he (5, + i cot 02) 
; ; (27.27) 
< eR a eee a ; na 
L_ = L, — iL, = he ( 55 + foot 0x) 


It is then possible, by using the properties of the spherical harmonics 
discussed in Sec. 14, to show that 


L4Vin(6,¢) = [1 + 1) — m(m + 1) PAV im41(8,6) 


L_Yim(8,¢) = [Ul + 1) — m(m — I} AY i m—-1(8,4) (27.28) 


as would be expected from Eqs. (27.25). 

It is apparent that the spherical harmonics can be regarded as 
constituting a unitary transformation from the angular momentum to 
the angular coordinate representation, in accordance with the trans- 
formation theory developed in Sec. 23: 


Yin(8,6) = (0¢|lm) (27.29) 


In using Eq. (27.29), it must be remembered that the weight function 
sin @ is to be included whenever an angular coordinate integration is 
performed. 


SPIN ANGULAR MOMENTUM 


Since the orbital angular momentum L = r x p is expressed in terms of 
the coordinates and momenta, the quantity L? is not in general a con- 
stant of the motion. Thus the quantum number / need not be well 
defined and cannot be used to characterize a particle except in very 
restricted circumstances. It is, however, possible to have an angular 
momentum S that satisfies the fundamental commutation relations 
(27.15) and such that S? commutes with all dynamical variables; a 
necessary condition is that S not be expressed in terms of rand p. Then 
S? is strictly a constant of the motion and can be replaced by the number 
s(s + 1)h?, where s is an integer or half an odd integer. We have already 
seen an example of such an intrinsic spin angular momentum, in which 
a vector wave function was found to haves = 1. As expected, this wave 
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function has 2s + 1 = 3 components, and the spin matrices (27.11) are 
essentially the same as the matrices in (27.26) that correspond to j = 1. 
The second set of matrices may be obtained from the first by a unitary 
transformation that merely has the effect of regrouping the components 
of the vector wave function (see Prob. 7). 

It is found experimentally that electrons, protons, neutrons, neu- 
trinos, and » mesons (muons) each have s = 3, photons have s = 1, and 
m mesons (pions) have s = 0. There are also other shorter-lived particles 
and aggregates of particles that are characterized by definite values of 
the spin angular momentum.! 


COVERING GROUP 


The angular momentum matrices that correspond to half-odd-integer 

values of 7 cannot represent pure orbital angular momenta and so can 

appear in nature only in connection with spin. - A particle or system 
that has j = 4, 3, . . . has the following interesting property. Suppose 
that a state of such a system is rotated through 27 radians. For conven- 
ience we choose the axis of rotation to be the z axis and calculate the 
rotation operator from Eq. (27.16): 


Un(2Qm) = e72¥tJain 


Since the J, matrices given in Eqs. (27.26) are diagonal, Up(2r) is also 
diagonal and has eigenvalues e~?**", where m is half an odd integer. Thus 
Up(2r) is equal to —1 times the unit matrix. It is easily seen that this 
result applies to rotation of 27 radians about any axis, since the com- 
ponent of J in that direction can always be chosen to be diagonal. 

We conclude that rotation of a half-odd-integer spin particle through 
2m radians changes the sign of its state function. An integer spin parti- 
cle is unchanged by a 27 rotation, whereas a half-odd-integer spin particle 
requires a 4x rotation to return to itself. Since s = } particles actually 
occur in nature, this result cannot be dismissed as a mathematical 
curiosity. That there is no conflict with experience follows from the fact 
that the wave function itself is not a directly measurable quantity. 
Rather, bilinear combinations of the wave functions appear in the results 
of measurements, and these do not change sign on 27 rotation. 

We see then that half of the matrices that can be used to represent 
J or Ur($) are double-valued with respect to the vectors. This suggests 
that these representations could be made single-valued by distinguishing 
between $’s that differ by a 27 rotation, for example, by enlarging the 


1 See, for example, K. Nishijima, “(Fundamental Particles” (Benjamin, New York, 
1964); S. Gasiorowicz, ‘Elementary Particle Physics,” pt. III (Wiley, New York, 
1966); A. H. Rosenfeld, A. Barbaro-Galtieri, W. J. Podolsky, L. R. Price, P. Soding, 
C. G. Wohl, M. Roos, and W. J. Willis, Rev. Mod. Phys. 39, 1 (1967). 
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sphere of radius 7 that contains the endpoints of these vectors into a 
sphere of radius 27. As before, the weight function for the group space 
is not constant throughout the sphere. This provides a geometrical 
isomorphism to a new group, which has twice as many elements as the 
rotation group. Although the rotation group is doubly connected, the 
new group is seen to be simply connected since the entire surface of the 
sphere of radius 2x corresponds to a single element of the group. This 
new group is called the universal covering group, or simply the covering 
group, of the rotation group. It is said to be homomorphic onto the rota- 
tion group since to each element of the covering group there corresponds 
one and only one element of the rotation group, to each element of the 
rotation group there corresponds at least one element (actually two) of 
the covering group, and there is a correspondence with respect to group 
multiplication.? 


UNITARY AND SPECIAL UNITARY GROUPS IN TWO DIMENSIONS 


The lowest-rank nontrivial representation of the generators of the rota- 
tion group are the first three matrices of Eqs. (27.26), which we write 
in the form 


J = 2h6 
0 1 0 -1 1 0 
me) wee: pall ok se 


The o’s are called the Pauli spin matrices, since they were first discovered 
in connection with electron spin.2, The Ur($) generated in accordance 
with Eqs. (27.16) and (27.30) have two rows and columns and are double- 
valued with respect to the vectors $, so that the Pauli matrices generate 
a representation of the covering group. It is apparent that, since J is 
hermitian, Ur(¢) is unitary. 

It is easily seen that the set of all unitary matrices with two rows and 
columns form a group for which the combination process is matrix multi- 
plication. This is the unitary group in two dimensions, designated U (2). 
However, this group is larger than the group of matrices Ur(}) con- 
structed from (27.16) and (27.30) with all real vectors $, since the 
matrices that appear in the exponent of Ur() have zero trace. Now, 
from Eq. (22.22), 


det (e4) = et - ~ (27.31) 


Since the trace of each component of ¢ is zero, the Ur(p) have unit 
determinant. Thus the covering group is what is called the special 


1 Wigner, op. cit., p. 249. 
2 W. Pauli, Z. Physik 48, 601 (1927); electron spin is discussed more fully in Sec. 41. 
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unitary group or the unitary unimodular group in two dimensions, desig- 
nated SU(2), which is the group of all 2 X 2 unitary matrices with deter- 
minant equal to +1. These matrices do in fact form a group since the 
unimodular property is preserved in matrix multiplication. 


THE GROUPS U(n) AND SU(n) 
A unitary matrix that has n rows and columns can be written in the form 
U = ef (27.32) 


where H is a hermitian matrix that has n rows and columns. All such 
matrices U clearly form a group, designated U(n), for which the combina- 
tion process is matrix multiplication. The diagonal matrix elements of 
H are real, and the off-diagonal elements symmetrically located with 
respect to the principal diagonal are complex conjugates of each other. 
Thus H, and hence also U, is characterized by n? independent parameters. 
The group U(n) is easily seen to be a continuously connected, n?-param- 
eter, compact Lie group. 

The trace of any hermitian matrix is real, and the determinant of 
any unitary matrix is a complex number of unit maemabude, so that in 
accordance with Eqs. (27.31) and (27.32) 


tr (H) =a det (U) = e# a real (27.33) 


Thus if the determinant of U is required to be equal to +1, a single 
constraint is placed on the n? parameters, since then a = 0 (mod 2m). 
These matrices also form a continuously connected, compact Lie group. 
It is designated SU(n) and has n? — 1 parameters. Every member of 
SU(n) is clearly a member of U(n), but the converse is not true. SU (n) 
is therefore said to be a subgroup of U(n). 

We denote a typical member of SU(n) by Uo, where 


Uy = ete tr (Ho) = 0 det (Us) = 1 (27.34) 


Then from a typical member of U(n), characterized by Eqs. (27.32) and 
(27.33), we may construct a corresponding matrix Uo such that 


H = Hy+ = 1 —U = (e!91)U) = Un(et#!1) 


where 1 is the n-dimensional unit matrix, and U» and Hy are related by 
(27.34). The numbers e/" are 1 X 1 unitary matrices that constitute 
the group U(1), and the n X n matrices e**/"1 evidently form an n- 
dimensional matrix representation of U(1). Thus any member of U(n) 
can be written as a matrix product of appropriate members of U(1) and 
SU(n). 
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It is not-difficult to see that U(n) is a subgroup of U(m), and SU(n) 
is a subgroup of SU(m), provided that n < m. Any member U of U(n) 
can be expanded to an m X m unitary matrix by adding rows and 
columns in the following way: 


| 2 : | (27.35) 


Here 1 is the unit matrix with m — n rows and columns, and O is a 
rectangular null matrix. The corresponding hermitian matrix H can 
be expanded in a similar fashion to 


EB S| (27.36) 


where clearly (27.35) and (27.36) are related by Eq. (27.32). Thus a 
particular subset of the matrices of U(m) constitute a matrix representa- 
tion of U(n), and similarly for SU(m) and SU(n). 


GENERATORS OF U(n) AND SU(n) 


The generators of any Lie group are defined in terms of the group 
elements that are infinitesimally close to the unit element, in analogy 
_with Eq. (27.6). Thus, if the group has s parameters, the s generators 
d; specify an infinitesimal element of the group in terms of s infinitesimal 
real parameters ¢;: 


8 
1+7) oa; 
j=1 

For U(n), the generators may be taken to be any n? linearly independent 
hermitian matrices with n rows and columns. Since 7 times the com- 
mutator of any pair of these is also hermitian, it can be expressed as a 
linear combination of the generators. Hence the Lie algebra of U(n) is 
closed. In the same way, the generators of SU(n) may be taken to be 
any n? — 1 linearly independent traceless hermitian matrices with n rows 
and columns. Again, 7 times the commutator of any pair of these is 
also hermitian and traceless, and so the algebra is closed. 

In the case n = 2, the special unitary group has 2? — 1 = 3 param- 
eters, which may be taken to be the three components of the real vector 
. A convenient choice for the three generators is that defined by Eqs. 
(27.30), which is consistent with the commutation relations (27.15). It 
is apparent that no two of the three generators commute with each other. 
The rank of a group is defined to be the maximum number of mutually 
commuting generators, so that SU(2) is of rank 1. 

We found earlier in this section that a particular representation of 
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a Lie algebra, and hence of the group generated by the algebra, is con- 
veniently specified by choosing as many operators to be diagonal as 
possible. Thus we started with the O(3) algebra, defined by the com- 
mutation relations (27.14) or (27.15), and chose a representation in which 
the commuting operators J, and J? are diagonal, as in (27.19). Solution 
of Eqs. (27.15) then led to half-odd-integer as well as to integer values 
of 7 and hence to the recognition that the covering group of O(3) is SU(2). 
In this way the operator J?, constructed from the generators, played an 
essential role in determining the representations. 

According to a theorem of Racah,! the number of independent 
operators that can be constructed from the generators, like J? above, 
and that commute with all the generators of a Lie group, is equal to the 
rank of the group. It was first recognized by Casimir? that one such 
operator can always be formed by taking a suitable bilinear combination 
of the generators; they are therefore called Casimir operators. Since 
SU(2) is of rank 1, its only Casimir operator is J?. 


THE SU(3) GROUP 


The special unitary group in three dimensions has 3? — 1 = 8 generators, 
Ai, . - - » As, Which may be chosen in many convenient ways. Since 
SU(2) is a useful subgroup of SU(3), we specify those three generators 
of SU(3) that are also generators of SU(2) by expanding the Pauli 
matrices (27.30) from two to three rows and columns in accordance with 
(27.36). 


010 0 -1 0 1 0 0 
1 =11 0 0 M=]2 O 0 %2=10 -1 0 
000 0 0 0 0 0 O 
(27.37) 


These first three of the eight \’s evidently satisfy the same commutation 
relations as the o’s; the relations may be written 


[o;,0;] = 21) €ijkOk 2; wp k= %, Y;, 2 or 1, 2, 3 (27.38) 
E 
where ¢; is the fully antisymmetric Kronecker symbol whose only non- 
vanishing values are 
€123 = €231 = €312 = —€132 = —€321 = —eéo13 = 1 (27.39) 


1G. Racah, Rend. Lincei 8, 108 (1950); “Group Theory and Spectroscopy,’? CERN 
preprint 61-8, 1961 (reprint of 1951 lectures at Institute for Advanced Study, Prince- 
ton, N.J.). : 


> H. B. G. Casimir, Proc. Roy. Soc. Amsterdam 34, 844 (1931). 
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The remaining five \’s may still be chosen in many ways. However, 
since the principal application of the SU(3) algebra is to the classifica- 
tion of elementary particles, we follow the notation adopted in that 
connection:! 


00 1 00 -7 0 0 0 
u=]0 0 0 s=!10 0 O A =}O O 1 
100 z 0 O 01 0 
(27.40) 
00 O 1 |! 0 0 
oz O V3l0 0 -2 
It is easily verified that the analog of Eq. (27.38) is 
[Ars] = 22 > figkNe i,j k=1,...,8 (27.41) 
k 


where fix is fully antisymmetric, so that it changes sign when any two 
of its indices are interchanged. The only nonvanishing values of fi; are 
permutations of the following: 


fizs =] Siar = fies = Sess = fosz = Sass = fate = $ (27.42) 
fuss =Sors = $ V3 


Thus fies is equal to the e123 given by (27.39), as expected. 

There are at most two of the eight generators that commute with 
each other, \, and one of the first three \’s in the present notation, so 
that SU(3) is of rank 2. Thus there are two Casimir operators that 
commute with all the \’s. One of these is : 

8 
C= Vx? (27.43) 

i=1 
and the other is a rather complicated sum of trilinear products of the 
’s.2 It is important to note that the relation [C,A,] = 0 must be estab- 
lished from the operator equations (27.41) to (27.43) and not from the 
particular matrix representation of the d’s given by (27.37) and (27.40). 


REPRESENTATION IN TERMS OF COORDINATES AND MOMENTA 

It was noted earlier that the orbital angular momentum operators (27.7) 
satisfy the commutation relations (27.14) for the generators of O(8) or 
SU(2), provided that the components of r and p satisfy the quantum 


1M. Gell-Mann and Y. Ne’eman, “The Eightfold Way” (Benjamin, New York, 1964). 
2J. J. de Swart, Rev. Mod. Phys. 35, 916 (1963). 
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conditions given. Thus we would expect to be able to identify three 
of the eight generators of SU(3) with the components of L. Once this 
is done, it is reasonable to expect that the other five generators can also 
be represented in terms of coordinates and momenta. Since there are 
five of them, it is natural to attempt to associate them with the five 
components of some quadrupole tensor.! We are led in this way to 
consider the following eight quantities as possible representations of the 
eight generators of SU(3): 


L, = yps — 2py Ly, = zp, — xp. L, = «py — ypz 
Qzy = axy + Bpspy Qye = ayz + Bpyps Q.iz = azt + Bp.p; 
(27.44) 


a 


Q = eg? + y? 2 byes De + py? — 2 
y 2/3 | y az") 2 9/3 (P= : 2p) 


Q1 = 5 (a =?) + £ (pz? — py?) 


where a@ and B are some dimensional real numbers. 

The 28 commutators of these eight operators are readily worked 
out and may then be compared with the commutators of the eight 2’s, 
given in Eqs. (27.41) and (27.42). It might at first be thought reasonable 
to identify L,, L,, and L, with multiples of \1, 2, and dz, respectively, 
and the commutation relations (27.38) admit of this possibility. How- 
ever, it then turns out to be impossible to find linear combinations of the 
d’s that correspond to the five Q’s. The reason for this derives from the 
difference between the way in which SU(2) is a subgroup of SU(3) and 
the way in which O(3) is a subgroup of SU(3).2. The Pauli matrices 
(27.30), which are the generators of SU(2), were expanded from two to 
three rows and columns to yield the first three )’s given in (27.37). On 
the other hand, O(3) is a subgroup of SU(3) for the reason that the 
rotation matrix R, given in Eq. (27.2), is a 3 X 3 real orthonormal 
matrix with determinant equal to unity and hence a special case of a 
3X 3 unitary unimodular matrix. It is clearly impossible to find a 
representation of the typical infinitesimal rotation (27.4) that is of the 
form (27.37), since the third rows and third columns of the latter all 
consist of zeros. This conclusion is not surprising when it is remembered 
that A1, Az, As generate the 7 = } representation of SU(2), whereas only 
integer j representations can be expressed in terms of coordinates and 
momenta. 


1J. P. Elliott, Proc. Roy. Soc. (London) A245, 128 (1958). 


> H. J. Lipkin, “Lie Groups for Pedestrians,” sec. 4.2 (North Holland Publishing 
Company, Amsterdam, 1965). 
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The proper identification is obtained by writing (27.4) in a form 
like (27.6): 


Re1- 7 (o-L) & infinitesimal 


It then follows from comparison with (27.37) and (27.40) that 
L, = hr7 L, = —hir; Lz = hrs (27.45) 


The remaining identifications are obtained by comparing the commuta- 
tion relations of the L’s and Q’s with those of the X’s: 


Qn =hVaBbr Qyz = h V aB Xe Qe2 = hVaB ru 
Qo = h VaB ds Qi =hVaB 3 (27.46) 
The Casimir operator (27.43) is, in this representation, 


C = h--L? + (h?aB)—!(Qzy? + Oy + Q.27 + Qo? + Q:”) 


= —3 + (3h!) |(3) r+ (2 p*] @7.a7) 


We return in Sec. 30 to this coordinate-momentum representation 
of SU(3), in connection with the three-dimensional isotropic harmonic 
oscillator. 


28LICOMBINATION OF ANGULAR MOMENTUM STATES 
AND TENSOR OPERATORS 


A central problem of the old quantum theory was the combination of 
the angular momenta associated with two parts of a system (such as the 
orbital angular momenta of two electrons in an atom, or the spin and 
orbital angular momenta of the same electron) to form the angular 
momentum of the whole system. The vector model solved this problem 
with an addition rule: The magnitude of the sum of two angular momen- 
tum vectors can have any value ranging from the sum of their magnitudes 
(parallel case) to the difference of their magnitudes (antiparallel case), 
by integer steps. This is called the triangle rule, since the magnitudes 
of the two angular momenta and their resultant must form a closed 
triangle (which may have zero area). The vector model also states that 
the sum of the z components of the angular momenta equals that of their 
resultant. 

Both of these rules are valid in quantum mechanics as well. We 
show first that they apply to those linear combinations of products of 
eigenstates of two commuting angular momentum operators that are 
eigenstates of the total angular momentum. The same kind of addition 
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formula is then shown to hold for products of rotation or tensor operators 
and for the state produced when a tensor operator acts on an angular 
momentum eigenstate. 

It is apparent that any number of angular momentum eigenstates 
and tensor operators can be combined by taking them two at a time in 
accordance with the methods developed in this section. However, anal- 
ogous procedures have also been worked out for the direct combination 
of three or more angular momentum eigenstates. ! 


EIGENVALUES OF THE TOTAL ANGULAR MOMENTUM 


We start with two commuting angular momentum operators J: and Jz; 
all components of J: commute with all components of Jz, and Ji and J; 
separately satisfy the commutation relations (27.15). The orthonormal — 
eigenstates of J:? and Ji, are |j1m1), and J2 has no effect on them. Simi- 
larly, |jzm2) are orthonormal eigenstates of J2? and J2,, and Ji has no 
effect on them. This representation is specified by the orthonormal set 
of kets |jymij2m2), each of which is a product of the kets |j1m) and 
|j2m2); any other parameters that may be needed to specify the kets can 
be ignored in what follows. 

Since J; and Jz commute, the total angular momentum J = Ji.+ Jz 
also satisfies Eq. (27.15). The orthonormal eigenstates of J? and J, are 
|jm) and specify a second representation. We wish to find the unitary 
transformation (rotation of axes in Hilbert space) that changes from one 
of these representations to the other. It is, however, unnecessary to 
deal with the entire infinite-dimensional Hilbert space at once. Instead, 
we consider the subspace for which j; and jz have definite values; its 
dimensionality is (271 + 1)(272 +1). With this restriction, the kets of 
the first representation may be denoted simply |mimz2), and our object 
is to find the unitary transformation (mym.|jm). Then in analogy with 
the first of Eqs. (23.4’), we have 


|jm) = |mim2Xmim2| jm) (28.1) 


where summations over m; from —j; to j; and over mz from —j:2 to je 
are implied. 

Since J, = Ji, + Jz, it is apparent that (mim:|jm) is zero unless 
m= m+ me. This immediately gives the second vector-model rule 
mentioned at the beginning of this section. It also follows that the 
largest value of m is j: + j2 and that this value occurs only once, when 
m, = j, and mz = j2. This shows that the largest value of 7 is 71 + je 
and that there is only one such state. The next largest value of m is 
ji + je — 1, and this oceurs twice: when m = ji and m2 = j2 — 1, and 
1 See, for example, A. R. Edmonds, ‘“‘Angular Momentum in Quantum Mechanics,” 
chap. 6 (Princeton, Princeton, N.J., 1957). 
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when m, = j1 — 1 and m2 = j2 (provided that neither 7; nor je is zero). 
One of the two linearly independent combinations of these two states 
must be associated with the new state for which 7 = j: + jz, since for 
that j value there must be values of m ranging from j; + j2 to —j1 — je 
by integer steps. The other combination cannot be associated with this 
or a larger j, since the larger m values that would then also be present 
actually are not. Therefore the second combination is associated with 
J =j1 +j2 — 1, and there is only one state with this 7 value. By an 
extension of this argument we can see that each 7 value, ranging from 
ji + je to [71 — jo| by integer steps, appears just once. This establishes 
the triangle rule of the vector model. . 

Each j value of the new representation has associated with it 27 + 1 
linearly independent combinations of the original eigenstates. Thus the 
number of |jm) eigenstates is 

Jitda 
yY +1) 
= 71 —Jal 


which is equal to (27: + 1)(2j2 + 1), as expected. 


CLEBSCH-GORDAN COEFFICIENTS 

The elements of the unitary matrix (mim:|jm) are the coefficients of the 
expansion of the eigenstates |jm) in terms of the eigenstates |mimz), in 
accordance with Eq. (28.1). They are called Clebsch-Gordan, Wigner, or 
vector-coupling coefficients. The inverse expansion to (28.1) is 


|[mim2) = |jm)(jm\mime) 


where summations over m from —j to 7 and over j from |j: — je| to 
j1 + jo are implied. The unitary character of the transformation matrix 
is expressed through the analogs of Eqs. (23.9’) and (23.29): 


(myma|jm)(jm|mym2) = (mima|myms) 
(jm|myma)(mymalj’m’) = (jm\j’m’) (28.2) 
(jm|mim2) = (mim2|jm)* 

The right sides of the first two of Eqs. (28.2) are unit matrices (products 
of Kronecker 6 symbols). We shall follow the usual convention and 
choose the matrix elements to be real, so that the asterisk may be removed 
on the right side of the third of Eqs. (28.2). 

It is possible to obtain explicit formulas for the Clebsch-Gordan 


coefficients.!_ However, they are sufficiently complicated so that it is 
generally simpler either to construct the coefficients as needed in particular 


Edmonds, op. cit., pp. 44-45. 
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cases or to refer to published tables.!. In what follows we first derive 
two recursion relations and then construct some simple cases. 


RECURSION RELATIONS 


We apply the angular momentum raising operator J, defined by Eq. 
(27.17) to the left side of Eq. (28.1), and the equal operator Ji, + J24 
to the right side. On making use of the first of Eqs. (27.25) and dividing 
by i, we obtain 


(7G +1) — m(m + 1)]4j, m + 1) 
= {(7idii1 + LD) — mim + DAs + 1, m2) + 
[j2(J2 + 1) — mo(me + 1)]\mi, me + 1)}(mim2|jm) 


We can now substitute for |j, m + 1) on the left side from Eq. (28.1). 
On the right side we note that the summation over m,; may be replaced 
by a summation over m, = m, + 1 in the first term; it is easily seen that 
the summation range of mj can also be taken to be —j; toj:. Similarly, 
the summation over mz may be replaced by the same summation over 
me + 1 in the second term on the right side. Since the kets |mim.) are 
orthonormal, we can then equate their coefficients on the two sides to 
obtain 


(GG + 1) — mm + 1)]Kmimi|j, m + 1) 
= (agi + LZ) — mil, — 1)]Kmi — 1, mal jm) 
+ [jo(j2 + 1) — mo(me — 1)]Kimi, m2. — 1\jm) (28.3) 
This procedure may be repeated, starting with J_ instead of J,. 
The result is 
[79 + 1) — m(m — 1)]Kmim,|j, m — 1) 
= [fiji + 1) — mlm, + 1)]Kmi + 1, maljm) 
+ [jeje + 1) — mo(me + 1)]mi, me + Aljm) (28.4) 


CONSTRUCTION PROCEDURE 


The matrix (mym2|jm) has (271: + 1)(2j72 + 1) rows and columns but 
breaks up into disconnected submatrices in accordance with the value 
of m = mi+m:.. Thus there will be a 1 X 1 submatrix for which 
m=jitjeandj =j1+ Je. Then there will be a 2 X 2 submatrix for 
which m = j1 + jo — 1 and 7 is either j1 + j2 or j1 + jJ2 — 1. The rank 
1 A list of tables published prior to 1957 is given by Edmonds, op. cit., p. 50; see also 
M. Rotenberg, R. Bivins, N. Metropolis, and J. K. Wooten, Jr., ‘‘The 3-j and 6-7 
Symbols” (Technology Press, Cambridge, Mass., 1959). Rosenfeld et al., op. cit., 


give the matrices for several values of j; and jz in condensed form on a detachable 
wallet card. 
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of these submatrices at first increases by unity from one to the next until 
a maximum rank is reached and maintained for one or more submatrices; 
thereafter it decreases by unity until the last 1 X 1 submatrix has 
m= —jr — je and j = ji+j2 Each of these submatrices is unitary, 
so that the first 1 X 1 submatrix is a number of unit magnitude, which 
we choose by convention to be +1: 


(jiJeljs + Ja ja + Jo) = 1 (28.5) 


We next use Eq. (28.4) with m: = ji, m2 = jo — 1,7 = ji + Jo, and 
m=jit je The first term on the right side is zero, and we obtain 
a ; oe ; jo \t 
(jt, J2 — Agr + Je, jr + Je — 1) (5 2) (28.6) 
when use is made of (28.5). Similar use of Eq. (28.4) with m = 7; — 1, 
Me = joj =jitje, and m = ji + je gives 
; ve ao oe ; a \t 
—] —-1)={- : 28.7 
(ji — 1, gala + Je, jr +52 — 1) (;, i) (28.7) 
Equations (28.6) and (28.7) give the half of the 2 X 2 submatrix 
of (mim2|jm) for which m = j1 + jo — 1 and 7 = 71+ je. The other 
half, for which m = ji + j2 —1 and j = ji +J2— 1, is obtained by 
making use of the second orthonormality relation (28.2). There is, 
however, an arbitrary multiplying factor of unit magnitude, which we 
choose so that 


; } 
(j1, J2 — Uji +e — 1,91 + j2 - 1) = (; J :) 


(28.8) 


; 4 
(ja = 1, jalju + Je => 1, j1 + ja a 1) i (5-45) 


2 


The convention here is that the first matrix element, which has the form 
(j1,J — jilj7), is real and positive. 

The next submatrix may be obtained in similar fashion. Sub- 
stitution of Eqs. (28.6) and (28.7) into the right side of (28.4) gives 


(ji, J2 — 2|j1 + ja ji + J2 — 2) 
_ | j2(2j2 — 1) | 
(j1 + Je) (271 + 272. — 1) 
(j1 — 1, f2 — Uj1 +52, ji: +92 — 2) 
: 4jije , (28.9) 
7 lor + ja) (2j1 + 252 — 1 5 | 
(j1 — 2, Joli + Je Ji + J2 — 2) 
if ji(2j1 — 1) | 
(1 + je) (271 + 272 — 1) 
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In the same way, Eqs. (28.4) and (28.8) lead to 
(ju, J2 — 2[gi + je — 1, fj: +h2 — 2) 


-| ja(2j2 — 1) | 

(a1 + 92) (ji + Je — 1) 

ji 1, je -— Ag + je — 1,91 + je — 2) 

(ja 2 ja + Je ite Rd psa) 


J 
~ (Gi + 32) + 32 — DI 
(hi — 2, jalja + je — Uji tje — 2) 
oe j2(2j1 — 1) | 
(j1 + J2)(J1 +72 — 1) 

We again use the second of Eqs. (28.2) to obtain a normalized set 
of coefficients that are orthogonal to (28.9) and (28.10). As with Eqs. 
(28.8), we choose the arbitrary phase factor so that the first of the follow- 
ing coefficients is real and positive: . 

(jy, Je — 2\ji + Je — 2, j1 + Je — 2) 

a jl. — 1) | 

. Fe + je — 1)(271 + 272 — 1) 
(1 — 1, j2 — Uji + J2 — 2, j1 + Je — 2) 

th (271 — 1I)(2j72 — 1) ; 

= lore Fie DQ + — D | eee 
(ji — 2, jalji +92 — 2, 91 + J2 — 2) 

= lees cee el 

~ LG + pe — DQ + 22 — 1) 

The only difficult part of the foregoing construction procedure is 
the use of orthogonality, which becomes progressively more complicated 
as the rank of the submatrix increases. However, it need be employed 
only once for each submatrix, and it is easier to work out an example 
with particular numerical values for 7; and j2 than the general case just 
considered. It should be noted that the first recursion relation (28.3) 
can also be used to construct the Clebsch-Gordan coefficients if we start 
at the other end with the 1 X 1 submatrix that has m = —j, — je and 
J=ht+je 


SOME PARTICULAR COEFFICIENTS 


The detachable wallet card referred to earlier contains condensed tables 
of Clebsch-Gordan coefficients for the seven cases 

ja: ¢ 118 3 2 «2 

jo) = & 1} «1 Fi 
The first three of these are reproduced below, written in full. Ineach case 


the rows of the matrix are labeled by (mm.) and the columns by a ) 
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It is 


easily verified that Eqs. (28.5) through (28.11) are in agreement with 


All the matrix elements outside the dotted lines are equal to zero. 
these matrices. 


(28.12) 


pare 
ras 
f fy 
[aise 
4--—4 
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Cs ee) en > | 
| | 


rie IN Rk oie 
1} | 


mR rik 


kr miles 


so 
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(28.13) 
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MATRIX ELEMENTS FOR ROTATED STATES 


In Sec. 26, a comparison was made between a matrix element of a dynami- 
cal variable Q, calculated for any pair of states, and the matrix element of 
the same operator when the states are displaced. Equation (26.6) 
showed in particular that the matrix elements of r for states displaced 
through the vector @ are equal to the corresponding matrix elements of 
r + o for the original states. In similar fashion, Prob. 5 shows that the 
matrix elements of r for states rotated through the infinitesimal vector $ 
are equal to the corresponding matrix elements of r+ xr for the 
original states. 

We can thus use the analog of Eq. (26.6) to define an infinitesimally 
rotated operator Qr as 


On = Unt()2Ua(4) ~ 9+ FICO - JM) (28.15) 


where use has been made of Eqs. (27.12) and (27.13). The matrix 
elements of Q for rotated states are equal to the corresponding matrix 
elements of Qe for the original states. It follows that the rotation prop- 
erties of any operator are determined by its commutator with the angular 
momentum. For example, a scalar operator is one that has the same 
matrix elements for rotated states as for the original states, or one for 
which Qe = ©. For such an operator [J,Q] = 0. As another example, a 
vector operator is one for which Eq. (28.15) leads to a relation of the same 
form that (27.3) has for the vector r. Thus Prob. 6 shows that Je ~ 
J+ox/J, so that angular momentum is also a vector operator, as 
expected. 


IRREDUCIBLE TENSOR OPERATORS 


Operators with rotation properties of higher order than scalars and vectors 
can also be constructed. For example, two vector operators A and B can 
be combined to form a nine-component tensor operator A,B;, where 
1,j = 2, y, 2. This is not a convenient form with which to work, any 
more than it is in classical physics; it is customary to use instead the 
scalar product, the vector product or antisymmetric tensor of second rank, 
and the traceless symmetric tensor of second rank, which have one, three, 
and five independent components, respectively. The latter can be shown 
to have the rotation properties of the spherical harmonics Y1,(0,¢) with 
l = 0, 1, and 2, respectively; they are examples of irreducible tensor opera- 
tors, so called because they have the simplest transformation properties.! 

As remarked in connection with Eq. (28.15), the commutator with 
J determines the rotation properties of an operator. In order to relate 


1 For further discussion, see U. Fano and G. Racah, “Irreducible Tensorial Sets” 
(Academic, New York, 1959). 
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these to the spherical harmonics Y1,(0,¢), we operate with J on 
Yim(9,6)f(0,¢), where f(6,$) is an arbitrary function of the angles.! The 
third of Eqs. (14.20) expresses J, as the differential operator —7h(a/d¢), 
so that 


JAY im(6,6)f(0,6)] = $(6,0)J2¥im(9,0) + Yim(0,6)J-f(8,0) 
The first term on the right is mi Yin(0,¢)f(0,¢), so that 


[Jes Vim(O,6)1f (8,6) = mRYim(9,6) f(8,6) 
Since f(6,@) is arbitrary, we obtain the commutation relation 

[J2,Yim] = mh Yim (28.16) 
In similar fashion, Eqs. (27.27) and (27.28) can be used to obtain 


[J4, Vim] = [10 + 1) — mm + DRY mas 


[J_, Y rl = [ll + 1) _ mim —_— 19) 4a (28.17) 


In analogy with the preceding paragraph, we now define an irreduci- 
ble tensor operator T(k), where k = 0,4,1,3, ... , to beaset of 2k 4+ 1 
operators T(k,qg), whereg = k,k — 1, ... , —k, that have commutators 
with J similar to those in Eqs. (28.16) and (28.17), where / or j is replaced 
by k and m is replaced by q: 


[J., T(k,q)] = ghT(k,q) 
(J+, T(k,g)] = [kk + 1) — g(a t+ DPAT(K, g + 1) (28.18) 
[J_, T(k,q)] = (kk + 1) — a(q — DBPATK, gd — 1D 


The importance of these operators derives from the fact that many 
dynamical variables of physical interest can be expressed in this form. 
For example, the 2! electric or magnetic multipole moment operator is an 
irreducible tensor operator with k = l. 


PRODUCT OF TENSOR OPERATORS 
The close similarity between Eqs. (27.25) and (27.28) on the one hand, 
and Eqs. (28.18) on the other, suggests that irreducible tensor operators 
can be combined in accordance with the same rule as angular momentum 
eigenstates. Some such rule is needed, since we have seen that the prod- 
uct of two vector operators is reducible. 

We therefore try to adapt Eq. (28.1) to this purpose. Its analog is 


T (kg) = ST s(kayqs) P2(ho,q2)(qrqalkg) (28.19) 


1 This is the method employed in deriving Eq. (24.34). 
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where the Clebsch-Gordan coefficient (qig2|kq) corresponds to 71 = k, and 
Je = k2; the summation is over qi from —k, to ky and over q2 from —kz to 
k,. As before, we apply J., Ji, and J_ in succession. However, in 
developing recursion relations from (28.1), we could apply J to the left 
side and Ji + J. to the right side. The analogous procedure here derives 
from the identity 


(J,7i:T 2] = [J,7:]T2 + TilJ,T.] (28.20) 
Thus if the commutator with J. is taken with both sides of (28.19) we 
obtain 

[J2, T(k,g)] = Yil:, T1(k1,91)|T2(ke,q2) 

: + Ti(ki,qi)[J2, T2(k2,92)]}(qige|kq) 
= Vga + g2)%T (kag) T'2(keg2)(qig2lkq) 
= ghT(k,q) | 


where use has been made of the first of Eqs. (28.18) applied to 7; and T, 
separately and of the fact that (qig2|kq) is zero unless g = qi + qo. Thus 
T(k,g) as defined by (28.19) also satisfies the first of Eqs. (28.18). 

Next we take the commutator of (28.19) with J,: 


(Js, T(k,g)] = DS {[ka(hr + 1) — (qi + I) PAT (as, gi + 1)T 2(ke,g2) 
+ [ke(ke + 1) — qe(ge + 1)}t4T1(k1,91) T2(%e, g2 + 1)}(qugelkg) 


where use has been made of the second of Eqs. (28.18) applied to 7, and 
T2 separately. Now, just as in deriving Eq. (28.3), the summation over 
qi in the first term may be replaced by a summation over q; = qi + 1, 
with the same range for q, from —k, tok. With similar treatment of the 
second term, we obtain 


[J4, T(k,g)] = | 
DHICAC + 1) — gigi — IP HT1(k1,93) T2(eo,92)(a1 — 1, golkg) + 
[Ka(kee + 1) —go(g2 — 1) T1(ks,91) T2(heo,92)(q1, 42 — 1kg)} 
We may now drop the primes and use the recursion relation (28.3) to 
simplify the right side: 
[J4, T(k,Q)] 
= Sik + 1) — ag + IPATiesg1) T2(ko,92)(gigelk, ¢ + 1) 


Then substitution from (28.19) with q replaced by g + 1 shows that 
T(k,q) satisfies the second of the commutation relations (28.18). In 
similar fashion, the commutator of (28.19) with J_ shows that T (q,k) 
satisfies the third of Eqs. (28.18). 
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We have thus shown that the 2k + 1 operators T'(k,q) defined by 
Eq. (28.19) constitute an irreducible tensor operator. The expansion 
(28.19) can also be inverted by making use of the orthonormality relations 
(28.2) to give the explicit reduction of a product of two irreducible tensors 
if the T(k,q) are known: 


T1(k1,91) To(ko,g2) = YT (k,q)(kalqug2) = YT (ka) (qugalkg) = (28.21) 


The summation here is over q from —k to k, and then over k from |ki — ko| 
to ki + ke; the last expression follows from the third of Eqs. (28.2) since 
the Clebsch-Gordan coefficients are real. 


COMBINATION OF OPERATOR AND EIGENSTATE 


Finally, we can find the analog of Eqs. (28.1) and (28.19) when one of the 
factors on the right is an irreducible tensor operator and the other is an 
angular momentum eigenstate: 


|jm) = YT i(ks,91) | jome)(gume| jm) (28.22) 


The Clebsch-Gordan coefficient (q:m2|jm) corresponds to j1 = ki and je; 
the summation is over gi from —k,; to ki and over m2 from —j2 to je. 
Two matters of notation are worthy of comment in connection with this 
equation. First, we have used subscripts 1 and 2 so that they correspond 
respectively to the first tensor operator on the right side of (28.19) and 
to the second eigenfunction on the right side of (28.1). Second, we have, 
as before, omitted all reference to the parameters other than angular 
momentum quantum numbers which may be needed to specify the states. 
The existence of these other parameters is implied and would in general 
cause the radial dependence, for example, of the ket |jm) to be different 
from that of the ket |j2m2). 
The identity analogous to (28.20) that is needed here is 
JT iljome) = [J,Ti]|J2m2) + TiJ|j2m2) 

The algebra proceeds exactly as in the preceding subsection and leads to 
the conclusion that the set of kets |jm) defined by Eq. (28.22) for a partic- 
ular 7 and all m between —j and j are angular momentum eigenstates 
since they satisfy equations like (27.25) and (27.28). However, they are 
not normalized, since it is not in general true that (jm|jm) = 1. On the 
other hand, the fact that the |jm) satisfy homogeneous equations like 
(27.28) shows that their normalization does not depend on m, although 
it will in general depend on J, and also on ki, j2, and the form of the tensor 
operator T1. 


WIGNER-ECKART THEOREM 


The series expansion of |jm) in terms of T1(k1,q1)|j2m2), given in Eq. 
(28.22), can be inverted by making use of the orthonormality of the 
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Clebsch-Gordan coefficients: 
T1(k1,q1)|Jome) = Slam) jm|qums) = dam )quma| jm) (28.23) 


The summations are over m from —j to j and over j from |ki — j2| to 
ki + jo; as with Eq. (28.21), use has been made of the third of Eqs. (28.2) 
and the reality of the Clebsch-Gordan coefficients to arrive at the last 
expression. 

We may now multiply Eq. (28.23) through from the left with a 
bra (j’m'|, which may depend on the other parameters in a different way 
than |jm). On making use of the orthogonality relation 


(j'm'|jm) = 0 unless j= 7’ and m = m’ 
we obtain 
(j’m'|T 1(k1,91)|J2me) = N(j’,k1,J2,T1)(qume|j’m’) 


N = (j’m'|j'm’) is the normalization factor mentioned at the end of the 
preceding subsection, which is independent of m’. We drop the primes 
and write N in a form that agrees with the standard convention:! 


(jm|T 1(ki,q1)|jamme) = (—1)h4t5(27 + 1)-H || Tae) || 92) que jm) 
7 (28.24) 


Equation (28.24) embodies the Wigner-Eckart theorem.? It states 
that the matrix element of an irreducible tensor operator between 
angular momentum eigenstates depends on the three m values (qi, me, 
and m) only through the Clebsch-Gordan coefficient (q1mz2|jm) that corre- 
sponds to 7; = k; and je. This is not a surprising result, since these 
three m values determine the orientation of the operator and the two 
states with respect to the coordinate system and hence should influence 
the value of the matrix element only through a geometrical factor (the 
Clebsch-Gordan coefficient). The physical properties of the matrix ele- 
ment are contained in the factor (j||T1(k1)||72), which is called the reduced 
or double-bar matrix element. It depends on the magnitudes of the three 
angular momenta (ki, j2, and 7) associated with the operator and the two 
states, on the form of the tensor operator T:, and on the other parameters 
that are needed to specify the two states. 

This factorization of the matrix elements of tensor operators greatly 
simplifies their calculation. Since k, is an integer for physical operators, 
it is simplest to calculate the reduced matrix element from Eq. (28.24) 
with gq: = 0 and with mz, =m =0 or m: = m = #4 as appropriate. 
Matrix elements for other combinations of qi, m2, and m are then readily 
obtained. 


1 Edmonds, op. cit., p. 75. 
* Wigner, op. cit., p. 245; C. Eckart, Rev. Mod. Phys. 2, 305 (1930). 
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Among the most important consequences of the Wigner-Eckart 
theorem are two conditions under which matrix elements of tensor oper- 
ators necessarily vanish. The appearance of the Clebsch-Gordan coeffi- 
cient shows that the matrix element is zero unless m = q; + m2 and 
unless ki, j2, and j satisfy the triangle rule. As an example, the latter 
condition shows that a system in a state with total angular momentum 
j cannot have a nonvanishing expectation value (diagonal matrix element) 
for the 2! electric or magnetic multipole moment operator unless 1 < 2). 
Thus a particle with spin zero cannot possess a magnetic dipole moment, 
and a particle with spin 4 cannot possess an electric quadrupole moment. 


29LJSPACE INVERSION AND TIME REVERSAL 


Parity has appeared earlier in this book in connection with both the one- 
dimensional (Sec. 9) and three-dimensional (Sec. 14) Schrédinger equa- 
tions. So long as the potential energy is unchanged when the coordinate 
r is replaced by —r, the energy eigenfunctions can be chosen either to 
remain unchanged or to change sign when r changes sign, that is, to have 
either even or odd parity. We start this section by applying the general 
approach of Sec. 26 to the relation between parity and the symmetry 
operation of space inversion (r — —r). 

We then consider the symmetry operation of time reversal. The 
classical equations of motion for particles that move under the influence 
of conservative forces are symmetric when the sense of progression of 
time is reversed, since they involve second time derivatives. It is to 
be expected that the corresponding quantum equations also possess time- 
reversal symmetry. However, the fact that the Schrodinger equation is 
of first order in the time derivative whereas the newtonian equations are 
of second order introduces new and physically interesting features into 
the treatment of the quantum situation.! 


SPACE INVERSION 


The matrix, analogous to R defined in Eq. (27.2), that produces an inver- 
sion is 


—i 0 0 
I= 0 -1 0 
0 0 -—I 
so that Jr = —r. It is apparent that J is real and orthonormal, but its 


determinant is equal to —1 so that it is not a proper rotation. Any 
3 X 3 real orthonormal matrix with determinant equal to —1 can be 


1 For a classical discussion of some aspects of space inversion and time reversal, see 
L. I. Schiff, Physics 1, 209 (1965).’ 
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written as a product of J and a proper rotation. The unit matrix and 
I by themselves form a discrete group with two elements, such that J? =1. 

The inversion of a physical system in a state represented by the 
ket |z) or the wave function y¥.(r) changes it into a state represented 
by |e’) or ¥a(r). The relation between the two states is assumed to be 


va'(It) = wfa(r) (29.1) 


where w is a number to be discussed below. The introduction of this 
number into (29.1) and its absence from the corresponding equations 
(26.1), (26.7), (27.1), and (27.8) are consequences of the discrete nature 
of the inversion group as contrasted with the continuous nature of the 
displacement and rotation groups. Any such number introduced into 
the earlier equations would have to depend continuously on 9, 7, or , as 
the case may be, and approach unity as these parameters approach zero 
since |e’) = |a) in the limit; it can then be shown not to have any physical 
consequences. 


UNITARY INVERSION OPERATOR 
The unitary inversion operator U, is defined by 


Ura) = |’) or Urbalt) = Par(r) (29.2) 


Substitution of (29.1) into the second of Eqs. (29.2), followed by another 
operation with U;, gives . 


Urba(t) = wha(—r) Urpa(t) = walt) (29.3) 


Two inversions bring the coordinate space into itself, so that U;? is 
expected to bring a state into itself. In such a situation, the state eannot 
change its norm, although it may be multiplied by a phase factor of unit 
magnitude. Thus w? is a number of unit magnitude, from which it 
follows that w is also. It is also true that w must be the same for all 
states that can be superposed with each other, that is, for all states that 
can be used to describe the same system of particles. For example, if 


v(t) = Y aape(t) 


where the a, are some set of numerical coefficients, then 


Ur Wy (r) = DY Wa GeWa (r) 


which will, in general, be a different state from ¥(r) unless the w,? are all 
the same. 

We shall assume that w has a definite value for each kind of particle. 
We have already noted near the end of Sec. 27 that a 2x rotation of an 
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integer spin particle leaves its state function unchanged, and we expect 
that this is also true for two space inversions. Thusw? = l,andw = +1. 
For half-odd-integer spin particles, we note from Eq. (28.1) that products 
of pairs of them can be superposed to give states of integer angular 
momentum. Thus we expect that w? for a half-odd-integer spin particle 
can be equal to the possible values of w for an integer spin particle. 
These are +1, so that a half-odd-integer spin particle can havew = +1, 
ay, 


INTRINSIC PARITY 


The experimental determination of w for various particles involves their 
mutual interactions.! In particular, it is found that a 7° meson (neutral 
pion), which has zero spin, disintegrates into two photons, so that its 
intrinsic parity can be found relative to that of the electromagnetic field, 
and corresponds to w = —1; this assumes that the parity of the system 
is conserved during the decay process. The neutral pion is therefore said 
to have odd intrinsic parity, or to be a pseudoscalar particle; this is in 
contrast with a spin zero particle with w = +1 (should such exist), which 
would be called a scalar particle. The intrinsic parity of the r+ mesons 
(charged pions), which also have zero spin, cannot be found relative to 
that of the electromagnetic field, since conservation of electric charge 
prevents their decay into photons. However, charged pions can be 
created or destroyed through interactions of nucleons (protons and 
neutrons), and so their intrinsic parity can be found relative to that 
of the nucleons, again if parity conservation during the interaction is 
assumed. Conventional usage assigns even parity to the nucleons and 
odd parity to the pions. 


INVERTED STATES AND OPERATORS 


The equation of motion of an inverted state can be found in the same 
manner as for a displaced state at the beginning of Sec. 26. As expected 
from analogy with Eq. (26.5), it is found that if a particular ket obeys 
the Schrédinger equation the inverted ket does also, provided that 
[Ur,H] = 0. In this case H and U; can be diagonalized simultaneously, 
so that the energy eigenstates can be chosen to have well-defined parities. 
Further, if |«) and U;|a) are linearly independent, there must be energy 
degeneracy. 

In analogy with the discussion of Eq. (26.6), the matrix elements of 
a dynamical variable Q for inverted states are equal to the corresponding 
matrix elements of U;'QU; for the original states. Since Uy is defined 
1C. N. Yang and J. Tiomno, Phys. Rev. 79, 495 (1950); G. C. Wick, A. S. Wightman, 


and E. P. Wigner, Phys. Rev. 88, 101 (1952); K. Nishijima, ‘‘Fundamental Particles,’’ 
p. 41 (Benjamin, New York, 1964). 
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to be unitary, multiplication of the first of Eqs. (29.3) from the left by 
U;* gives 

alt) = oUrya(—t) or =U rta(t) = wYa(~r) 


We can then evaluate U;trU; by allowing it to operate on an arbitrary 
state ¥.(r): 


UrtrU malt) = Urtrwpa(—1) = wtw(—1)pa(r) 


so that 
UytrU; = —r . (29.4) 
In similar fashion, since p = —7#V and L =r x p, it is easily seen that 
U;tpUr = —p UfrLU; = L (29.5) 


Since U; affects the space coordinates but not the spin, we expect that 
it commutes with S and hence also with J = L + S, so that 


UrSU; =S$ U;tJUr = J (29.6) 


Equations (29.4) to (29.6) are in agreement with what would be expected 
classically from the space-inversion properties of coordinates, momenta, 
and angular momenta. Coordinates and momenta are examples of what 
is called a vector or a polar vector, and an angular momentum is an azzal 
vector or a pseudovector. 


TIME REVERSAL’ 


The reversal in time of a state represented by the ket |a) or the wave 
function ¥. changes it into the ket |a’) or the wave function y. that 
develops in accordance with the opposite sense of progression of time. 
For this new state the signs of all linear and angular momenta are reversed 
but other quantities are unchanged, Time reversal is effected by a time- 
independent operator T such that 


T\a) = |a’) or Ta = Wa (29.7) 


We shall assume in what follows that T is a symmetry operation for 
closed isolated physical systems; at present all experimental evidence 
is consistent with this assumption. This means that if |k) or u, represents 
an eigenstate of the hamiltonian (which is constant in time) with energy 
eigenvalue E,, then T|k) or Tu, also represents an eigenstate with the 
same eigenvalue. 

1 Time reversal was introduced into quantum mechanics by E. P. Wigner, Géttinger 


Nachr. 31, 546 (1932). See also E. P. Wigner, ‘“‘Group Theory,” chap. 26 (Academic, 
New York, 1959). 
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At some particular time, say t = 0, the wave function y, can be 


expanded in terms of the energy eigenfunctions uw, in accordance with 
Eq. (23.22): 


Wa = SxGan ile . (29.8) 


We now consider two pairs of operations that can be performed on Wq 
that are expected to lead to the same physical state. In the first case we 
allow the state to propagate to time ¢ and then reverse it. In the second 
case we reverse it at ¢ = 0 and then allow the reversed state to propagate 
with the opposite sense of progression of time, that is, to time —t. With 
the first pair of operations, propagation to time t takes the wave function 
(29.8) into 


Sedane Eat! Ay, 


If now we tentatively assume that T is a linear operator (which we shall 
immediately see leads to a contradiction), time reversal gives 


SpCare Bet? Tay, (29.9) 


With the second pair of operations, time reversal at t = 0 takes the wave 
function (29.8) into 


SeGar T'Ux 
Since the energy eigenvalue of Tu, is E,, propagation to time —t.then 
gives 

Span etzetl "Tu, : (29. 10) 
It is apparent that, in general, the wave functions (29.9) and (29.10) are 
not multiples of each other, as they must be if they are to represent the 
same physical state. Thus T cannot be a linear operator. 
ANTILINEAR OPERATORS 


The foregoing contradiction is clearly connected somehow with the oper- 
ation of complex conjugation. This is not surprising, since a change in 
the sign of ¢ in the Schrédinger equation for y. changes it into the equation 
for ¥2, provided that H is real and independent of the time. Therefore, 
instead of assuming that 7' is a linear operator with the property 


T (a1 + aoe) = aT + aT yr 
we try the assumption that T has the property 

T(awi + ao) = af TY: + az Tyr (29.11) 
Such an operator is said to be antilinear. With the assumption (29.11) 
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for T, the two states (29.9) and (29.10) both become 
S.0ax eBay, 


and the contradiction disappears. 
Application of T to both sides of the Schrodinger equation 


h 4, We 


“at = Ve 


gives 


ap OCP) 
— th = ae THY. 


Thus if 7 commutes with the hamiltonian, so that | 
[7,H] = 0 (29.12) 


then Ty, satisfies the Schrédinger equation with ¢ replaced by —t#, as 
would be expected. It also follows from (29.12) that Tu, is an energy 
eigenfunction with eigenvalue E;, as assumed above. Thus the condition 
that T is a symmetry operation is that Eq. (29.12) be satisfied. The 
situation here is no different from that with the unitary symmetry oper- 
ations discussed earlier, for example, in connection with Eq. (26.5). 


ANTIUNITARY OPERATORS 


An antilinear operator can be expressed as the product of a linear operator 
and the complex-conjugation operator K, which is defined by 


Ky = ¥* 


for an arbitrary function y. Wenote that K? = 1. An especially useful 
class of antilinear operators consists of those for which the linear operator 
that multiplies K is unitary; such operators are called antiunitary. It is 
apparent that K itself is antilinear and also antiunitary. It is easily 
shown (Prob. 15) that the inner product of two state functions is changed 
into its complex conjugate when the states are operated on with the same 
antiunitary operator. The same argument shows that the norm of a 
state is unchanged since it isreal. It is plausible to expect that the norms 
of states and the absolute magnitudes of the inner products of pairs of 
states are unchanged by time reversal. We therefore assume that T is 
antiunitary and write it in the form 


T = UK . (29.13) 


where U is unitary. 
If H is complex, as with the optical-model potential considered in 
Sec. 20, the condition (29.12) requires that U transform H* into H: 
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UH*Ut = H. There is, in general, no unitary operator that has this 
property, so that the physical system represented by a complex hamil- 
tonian is not time-reversal invariant.! This is to be expected, since the 
absorption of particles by a complex potential specifies a particular sense 
of progression of time, as shown in Eq. (20.1). The situation here is 
roughly analogous to friction in classical particle motion or viscosity in 
classical hydrodynamics, which introduces forces that change sign when 
the velocity changes sign, thus destroying time-reversal invariance. 
Both the complex potential in quantum mechanics and friction or 
viscosity in classical mechanics provide simple phenomenological descrip- 
tions of much more complicated situations and do not of themselves imply 
a lack of time-reversal invariance at a more elementary level. 


T FOR A ZERO SPIN PARTICLE 


We now obtain an explicit expression for T such that for the time- 
reversed state the signs of all linear and angular momenta are reversed but 
other quantities are unchanged. The simplest case is a spin zero particle 
in which a state is represented by a one-component wave function. 
Operation on some ¥_ with the coordinate produces a new state which 
we call ¥g: ry. = Ws. If the sign of r is not to change on time reversal, 
we expect this relation to hold also for the time-reversed states; that is, 
if va = Ty and yy = Tyg, then ra = ve. It then follows that 


rT y. = THe! = Wp’ = TY, = Tra 
Since y. is an arbitrary state, we have that 
rT = Tr (29.14) 


Operation on ¥,. with the momentum produces a new state which we 
call y,: pwa = vy. If now the sign of p is to change on time reversal, 


we expect to have for the time-reversed states py. = —y,. We then 
have 
PT Ya = Pha = —vy = —Ty, = —Tpa 
so that 
pT = —Tp (29.15) 
In similar fashion, since L = r x p, 
LT = —TL (29.16) 
In the coordinate representation, r is a real operator and p = —ihV 


is a pure imaginary operator. The simplest choice for T that satisfies 


1 Note that, in spite of this, the reciprocity theorem (20.16) is valid in this situation. 
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Eqs. (29.14) to (29.16) is then 
U=1 T=K (29.17) 


This conclusion evidently depends on the choice of representation. Thus 
if we use the momentum representation, in which yp. is a function of p 
rather than of r, then p is a real multiplicative operator and r is pure 
imaginary, for example, x = th(0/dp,). In this case Eq. (29.18) is still 
valid, but U is now the operator that replaces p by —p: Uva(p) = va(—p). 
We shall always work in the coordinate representation, since this is most 
familiar and is also convenient. 


T FOR A NONZERO SPIN PARTICLE 
For a particle with spin, we expect in analogy with Eq. (29.16) that 


ST = —TS and JT = —TJ (29.18) 


Since the explicit form of 7 depends on the representation, we must 
choose a specific set of S matrices when s ~ 0, in addition to the earlier 
choice of the coordinate representation. We have already seen (see 
Prob. 7) that the s = 1 spin matrices (27.11) and the 7 = 1 matrices in 
(27.26) differ only by a unitary transformation, so that they correspond 
to different choices of representation. When dealing with spin, we shall 
for definiteness always work in the representation specified by Kgs. 
(27.26). 

It then follows that r, S., and S, are real operators, and p, L, and 
S, are pure imaginary operators. Thus if 7 were equal to K, Eqs. 
(29.14) to (29.16) and the S, equation in (29.18) would be satisfied but 
the S, and S, equations would not be. We must therefore choose U so 
that it commutes with r, p, L, and S, and so that 


S,U = —US, and S,U = —US, (29.19) 


Thus U may be any unitary operator that is a function only of S, and that 
satisfies Eqs. (29.19). 

We know from Eq. (27.16) that the unitary operator exp (—7 - J /h) 
rotates a state through the vector angle @. Also, through Eq. (28.15), 
it transforms an operator into the same operator rotated through 9. In 
similar fashion, the unitary operator exp (—7 + S/h) transforms any oper- 
ator function of S into the same function in which the argument is the 
result of rotating S through $. In particular, e~*"5y/* is a unitary trans- 
formation that rotates S through 7 radians about the y axis and hence 
transforms S, into —S, and S, into —S,. We may therefore put 


T = e**S,/AK (29.20) 
That this 7 has the desired properties is shown explicitly in Prob. 16. 


232 QUANTUM MECHANICS 
For a spin % particle, Eq. (29.20) takes the particularly simple form! 


T = —t0,K where cy = Ee a | (29.21) 


SYSTEMS OF SEVERAL PARTICLES 


When several particles are present, T may be constructed by multiplying 
together the U’s that correspond to each particle and then multiplying 
this product into K: 


T = eritS yh... git Say lAK (29.22) 


The order of factors in the product of exponentials is unimportant since 
each operates only on the state of that particle; thus they all commute 
with each other, and T satisfies the first of Eqs. (29.18) for each S. 

Since each S, is pure imaginary, each exponential in (29.22) is real 
and hence commutes with K. Then, since K? = 1, we see that 


T2 = en ttr8 lh Steet e72trSaylA 


Each of these exponentials is a rotation through 27 radians. According 
to the discussion in Sec. 27, it is equal to +1 for an integer spin particle 
and —1 for a half-odd-integer spin particle. Thus 7? is equal to +1 or 
—1 according as the total number of half-odd-integer spin particles in 
the system is even or odd. 

As remarked below Eq. (29.12), if uw, is an energy eigenfunction, 
Tu, is also an energy eigenfunction with the same eigenvalue. We 
suppose at first that there is no degeneracy. Then 7’, represents the 
same state as uz, so that Tu, = cux, where c is some number, and 


Tu = Tou, = c*Tu, = |cl?ux 


If T? = +1, then |c|? = 1, which is a possible situation. Butif T? = —1, 
there is no number c, which means that there must be some degeneracy. 
In this case, 4, and T’u, are orthogonal, as may be seen in the following 
way. It is shown in Prob. 15 that (Ty1,TW.) = (fo). If we choose 
¥i= Tu, and yo = um, this relation becomes (T?u,Tuz) = (ux, Tux). 
Since 7? = —1, the left side is the negative of the right side, so that 
both sides are zero and Tu, is orthogonal to u,. Thus for every uw, there 
is a distinct Tu, that is degenerate with it, so that the overall degeneracy 
is even. 

For a crystal of low symmetry, each atom is in an unsymmetrical 
environment, and one would not normally expect any degeneracy for the 
states of the electrons in that atom. These electrons are moving in the 


1g, is one of the three Pauli spin matrices defined in Eq. (27.30), which are discussed 
more fully in Sec. 41. 
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static electric field produced by the rest of the crystal, which can be 
represented by a term )) e¢(r,) in the hamiltonian, as in Eq. (24.29), 


2 

where r; is the coordinate of the 7th electron of the atom. Such a term 
evidently does not destroy the time-reversal invariance of the rest of 
the hamiltonian that describes the atom. Then if the number of electrons 
per atom is odd, T? = —1 and there must be at least a twofold degen- 
eracy; this is called Kramers degeneracy.1 The degeneracy is removed 
if the crystal is placed in an external magnetic field H. As we shall 
see later (Sec. 48), the magnetic effects associated with the orbital motion 
and spin of each electron cause interaction terms to appear in the hamil- 
tonian that are proportional to L-H and S-H. Equations (29.16) and 
(29.18) then show that the hamiltonian is no longer time-reversal invar- 
iant. Thus the magnetic field produces a splitting of the energy levels 
if the number of electrons per atom of the crystal is odd. It should 
be noted that this situation is not in contradiction with the assumption, 
stated just below Eq. (29.7), that T is a symmetry operation for closed 
isolated physical systems. An external magnetic field is, by definition, 
imposed from outside the system under consideration; the degeneracy is 
removed because we have assumed implicitly that the currents producing 
the field H are not reversed by T. 


REALITY OF EIGENFUNCTIONS 


Let us consider a system that has no spin or in which the spin is not 
significant. Then U = 1 and T = K in the coordinate representation. 
Further, we suppose that there is an operator 2 that commutes with K 
and that has nondegenerate eigenvalues w,: Qu, = w,v,. Then Ku, repre- 
sents the same state as v,, so that Kv, = vn = cv,, where c is some number. 
If now we write v, as the sum of its real and imaginary parts, v, = Wy + 1%,,; 
where w, and z, are real functions, then 


Wy — tay = C(We + 12,4) or (1 ~ c)w, = 11 + clay 


This means that w, and z, are multiples of each other, so that v, is real 
except for a possible complex multiplying coefficient. In particular, all 
nondegenerate energy eigenfunctions are real in this sense if the system 
is time-reversal invariant. 

The foregoing argument can sometimes be extended to the case in 
which there is degeneracy. An interesting example is provided by a real 
hamiltonian that contains an arbitrary spherically symmetric potential, 
so that 


[H,L?] = 0 [K,H] = 0 [K,L?] = 0 


1H. A. Kramers, Koninkl. Ned. Akad. Wetenschap., Proc. 88, 959 (1930). 
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Kigenfunctions vim labeled by an energy eigenvalue E; and an orbital 
angular momentum quantum number / are still degenerate with respect 
to the magnetic quantum number m. However the operator associated 
with m, L, = —ih(0/d¢@), is pure imaginary and, in accordance with Eq. 
(29.16), does not commute with K. Thus we cannot argue, as was done 
in the preceding paragraph, that Kvzim = CUiim so that vim is real; in 
actuality it is not real in general since it is proportional to the spherical 
harmonic Y,,(0,¢), which is complex for m = 0. 

If, however, we restrict our attention to the case m = 0, we are 
dealing only with eigenfunctions of L, that have the eigenvalue zero. 
It then follows from the relation L,K = —KL, that K times an eigen- 
function of L, is still an eigenfunction of L, with eigenvalue zero. Thus — 
if there is no additional degeneracy, we can be sure that those eigen- 
functions v.10 of H and L? that have m = 0 are all real in the sense used 
above. These are just the eigenfunctions used in the scattering theory 
of Secs. 19 and 20. As expected, the phase shifts 4, are real or complex 
according as the hamiltonian is or is not time-reversal invariant.! 


30LIDYNAMICAL SYMMETRY 


We have seen in Sec. 26 that symmetry and degeneracy are associated 
with each other. For example, a system that possesses space-displace- 
ment symmetry is usually degenerate with respect to the direction of the 
momentum vector p, an exception arising when p = 0. Similarly, a 
system that possesses rotational symmetry is usually degenerate with 
respect to the direction of the angular momentum vector J, that is, with 
respect to the eigenvalue of a particular component such as J,. Again, 
the case J = 0 is exceptional. In the cases of the discrete symmetries 
of space inversion and time reversal, degeneracy is less common, since the 
transformed states are more likely to be the same as the original states. 

It was pointed out in Chap. 4 that the hydrogen atom (Sec. 16) and 
the isotropic harmonic oscillator (Prob. 12) have additional degeneracy 
beyond that associated with rotational symmetry. As remarked there, 
this is to be expected whenever the wave equation can be solved in more 
than one way, in different coordinate systems or in a single coordinate 
system oriented in different ways. From our present point of view, we 
also expect these degeneracies to be associated with some symmetry, 
which evidently is not of the geometrical type considered thus far in this 
chapter. We call such symmetries dynamical, since they arise from par- 
ticular forms of the force law. In the two relatively simple cases con- 
1 For an extension of the methods of this subsection to systems in which spin is sig- 


nificant, see R. G. Sachs, ‘‘Nuclear Theory,’’ App. 3 (Addison-Wesley, Reading, Mass., 
1953). 
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sidered in this section, the existence and general nature of the dynamical 
symmetry can be inferred from the corresponding classical system, in 
much the same way as with geometrical symmetries. This is not possible 
in general; indeed, many situations of physical interest have no classical 
analogs. 


CLASSICAL KEPLER PROBLEM 


The classical hamiltonian for the Kepler problem in relative coordinates 
is 
_P_« 

H= one (30.1) 
where y is the reduced mass and « is a positive quantity. For the hydro- 
gen atom, comparison with Sec. 16 shows that « = Ze?. A particular 
solution of the classical orbit problem is an ellipse with semimajor axis a 
that is equal to half the distance from perihelion P to aphelion A (Fig. 25), 
and with eccentricity e that is equal to (a? — b)!/a, where b is the semi- 
minor axis. 

Since H is independent of the time, the total energy £ is a constant 
of the motion. Also, since H possesses rotational symmetry, the orbital 
angular momentum L = r x pisa constant of the motion. Both of these 
statements are easily established from Eq. (24.22) and require calculation 
of some Poisson brackets. It is not difficult to show that 


E=—- = and L? = pxa(l — e?) (30.2) 


L is evidently an axial vector that is perpendicular to the plane of the orbit. 

The rotational symmetry of H is enough to cause the orbit to lie in 
some plane through O, but it is not enough to require the orbit to be closed. 
A small deviation of the potential energy from the newtonian form 
V(r) = —(k/r) causes the major axis PA of the ellipse to precess slowly, 
so that the orbit is not closed. This suggests that there is some quantity, 
other than H and L, that is a constant of the motion and that can be used 
to characterize the orientation of the major axis in the orbital plane. We 


Fig. 25. Classical Kepler orbit with the 
center of attraction at a focus O of the 
ellipse. 
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thus look for a constant vector M, which we expect to lie along the major 
axis, pointing from O to P or from O to A. 

Such a vector has been known for a long time and is called the Lenz 
vector or the Runge-Lenz vector... We write it in the form 


L 
M=PX"™_*, (30.3) 
Me r 
It is easily seen to be a constant of the motion, to have magnitude xe, and 
to be directed from O to P. The following relations are independent of 
the particular choice of the orbital parameters a and e: 


L-M=0 oe ee (30.4) 


HYDROGEN ATOM 


In order to treat the hydrogen atom, the foregoing quantities must be 
translated into quantum mechanics. This has already been done for r, 
p, and L. For M, we note that p x L is not equal to —L x p, so that 
Eq. (30.3) does. not define a hermitian quantity. We therefore redefine 
M as a symmetric average: 


1 . K 
M= 7, @xL—Lxp)— 7r (30.5) 


It can then be shown from the commutation relations for r and p, after a 
considerable amount of computation, that 


[M,H] = 0 L:-M=M-L=0 


30.6 
Mt = 2 +a) +e ( ) 


These are the quemeumemorhanical analogs of the constancy of M and of 
Eqs. (30.4). 

Equations (30.5) and (30.6) were used by Pauli? to find the energy 
levels of the hydrogen atom, independently of and simultaneously with 
Schrédinger’s® treatment on the basis of the wave equation that was dis- 
cussed in Sec. 16. Pauli’s approach is equivalent to regarding the three 
components of M as generators of some infinitesimal transformations, in 
much the same way that the three components of L were regarded in Sec. 
27 as generators of infinitesimal rotations about the three axes. We thus 
proceed by working out the algebra of the six generators L, M, which con- 
sists of 15 commutation relations. Three of these have already been 


1C, Runge, “Vektoranalysis,” vol. 1, p. 70 (S. Hirzel, Leipzig, 1919); W. Lenz, Z. 
Physik 24, 197 (1924). 

2'W. Pauli, Z. Physik 36, 336 (1926). 

3K. Schrédinger, Ann. Physik 79, 361 (1926). 
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given in Eqs. (27.14) and are cyclic permutations of 


[Lz,Ly] = thL, (30.7) 
Nine more are cyclic permutations of 
[M,,L,] = 0 [M,,Ly] = aM, [M,,L.] = —ithM, (30.8) 


The last three are much more difficult to calculate and are cyclic permu- 
tations of 

[M.,M,] = — a HL, (30.9) 

The L by themselves constitute the closed algebra (30.7) and, as we 
have seen in Sec. 27, generate the group O(3). The L and M together, 
however, do not form a closed algebra since, although Eqs. (30.8) involve 
only L and M, Eq. (30.9) brings in H as well. However, since H is inde- 
pendent of the time and commutes with L and M, we can work in a sub- 
space of the Hilbert space that corresponds to a particular energy eigen- 
value E of the hamiltonian H. Then H may be replaced in Eq. (30.9) by 
E, which for bound states is a negative quantity. It is convenient now to 
replace M by 


mw =(-+) (30.10) 
= aR ; 
The commutation relations (30.8) hold as well for M’, and (30.9) is 
replaced by 
[M,,Mi) = ial, | (30.11) 


THE 0(4) GROUP 


The six generators L, M’ constitute a closed algebra, which can be 
identified with a known algebra in the following way. We relabel the 
coordinate vector r = (x,y,z) and the momentum vector p = (pz,py,Pz) as 


r= (r1,72,73) and P = (p1,p2,s) 

and relabel the angular momentum vector L = (L,,,,L.) accordingly as 
L = (L93,L31,L12) 

We thus have 
Lig = Tspy — Typ [rip] = 1h; (30.12) 


where 2,7 = 1, 2,3. We now extend Eqs. (30.12) to 7,7 = 1, 2, 3, 4 by 
inventing fourth coordinate and momentum components r4 and p, such 
that 

mM. = Du M;, = Lies M; => Liss (30.13) 
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It is easily verified that Eqs. (30.12) and (30.13) lead to the commutation 
relations (30.7), (30.8), and (30.11). 

The six generators L,; obviously constitute the generalization of the 
three generators L from three to four dimensions. The group that they 
generate can be shown to be the proper rotation group or orthogonal 
group in four dimensions, designated O(4), which is the set of all 4 X 4 
real orthonormal matrices with determinant equal to +1. This evidently 
does not represent a geometrical symmetry of the hydrogen atom, since 
the fourth components r, and pz, are fictitious and cannot be identified 
with dynamical variables. For this reason, O(4) is said to describe a 
dynamical symmetry of the hydrogen atom. It does, of course, contain 
the geometrical symmetry O(3) as a subgroup. 

It is important to note that the O(4) generators were obtained by 
restricting our considerations to bound states. For continuum states, E 
is positive, and the sign inside the square root of (30.10) must be changed 
in order for M’ to be hermitian. Then the sign on the right side of Eq. 
(30.11) is changed, and the identifications (30.12) and (30.13) are no 
longer valid. It turns out that the dynamical symmetry group in this 
case is isomorphic to the group of Lorentz transformations in one time and 
three space dimensions, rather than to the group of rotations in four space 
dimensions.! 


ENERGY LEVELS OF HYDROGEN 


The energy eigenvalues can now be found with practically no further 
effort. We define two quantities 


I=3i14M’) K = i(L — M’) (30.14) 
which are easily seen to satisfy the commutation relations 

I,,J,] = thi, ete. K,,K,] = thK, ete. 

[L.,Iy] = 2 etc [ y) = 2 (30.15) 

[I,K] = 0 [1,H] = [K,H] = 0 


Thus I and K each constitute an O(3) or SU(2) algebra, and we see at once 
that the possible eigenvalues are 


P= ita + la? K? = k(k + l)h? 44=0,3,1,... 
(30.16) 


It is easily seen from the commutation relations (30.15) that the O(4) 
group is of rank 2. Thus there are two Casimir operators, which may 
evidently be chosen to be 


P=i(L+M’? and K? = i(L — M’)? 


1V. Fock, Z. Physik 98, 145 (1935); V. Bargmann, Z. Physik 99, 576 (1936); M. 
Bander and C. Itzykson, Rev. Mod. Phys. 38, 330, 346 (1966). 
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Alternatively, they may be chosen to be the sum and difference of I? 

and K?: | 
C=P+ K? = }(L?+ M”) and C’ =I? -—K? =L-M’ 
(30.17) 


The second of Eqs. (30.6) shows that C’ = 0, so that we are dealing 
only with that part of O(4) for which I? = K?. Thus 7 = k, and the 
possible values of the first Casimir operator are 


C = 2k(k + 1)h? k=0,4,1,... (30.18) 
The third of Eqs. (30.6), together with (30.10) and (30.17), then gives 


_l <= ite 2) — _ BR 2 
c=3(u fy M*) = ap ah 


With the expression (30.18) for C, we obtain 


Bie i (30.19) 
2h2(2k + 1)? 
Equation (30.19) agrees with the wave equation result (16.15) if we 
remember that x = Ze? and make the natural identification n = 2k + 1, 
which gives n the sequence of values 1, 2,3, .... 

It is important to note that there is no objection to using half-odd- 
integer values for 7 and k in (30.16). The only physical restriction is that 
L? = I(1 + 1)h? have only integer values of J. But, since L=I1+K 
from (30.14), the triangle rule of Sec. 28 shows that J can have any value 
ranging from 7 + k = 2k = n — 1 down to |i — k| = 0, by integer steps. 
Thus / not only is restricted to integer values but has the correct range of 
values with respect to the total quantum number n. The degeneracy of 
this energy level is also given correctly since J, and K, can each have 
2k + 1 = n independent eigenvalues, and there are therefore n? possible 
states altogether. 

Finally we note that, as discussed in connection with Eq. (29.5), L is 
an axial vector and does not change sign on space inversion. In similar 
fashion, it is apparent that M defined by (30.5) is a polar vector, which 
does change sign. Thus we expect that states defined by the symmetry 
generators L and M need not have well-defined parity. This is actually 
the case, since states of even and odd I are degenerate in the hydrogen 
atom. 


CLASSICAL ISOTROPIC OSCILLATOR 


The three-dimensional isotropic harmonic oscillator is described by the 
hamiltonian 


2 
H = F_+ 3Kr° . (30.20) 
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Fig. 26 Classical harmonic- 
oscillator orbit with the center 
of attraction at the center O of 
the ellipse. 


This is the generalization of the linear harmonic oscillator discussed in 
Sec. 13, for the case in which the force constant is the same in all direc- 
tions. A particular solution of the classical orbit problem is an ellipse 
with semimajor axis a and semiminor axis b, which has its major axis 
oriented so as to make an angle y with the x axis (Fig. 26). As in the 
Kepler problem, H and L are constants of the motion, with values 
given by 


E = 4K(a? + 6?) and L? = mKa’b? 


The fact that the orbit is closed again suggests that there is some 
other constant of the motion that can be used to characterize the orienta- 
tion angle y. There is, however, a striking difference between Figs. 25 
and 26. In the Kepler problem, the center of attraction O is at a focus of 
the ellipse, whereas in the oscillator problem it is at the center. Thus the 
two directions OA and OP along the major axis are not equivalent in the 
Kepler orbit, and the minor axis is not asymmetry element. In contrast, 
both directions along the major axis and both directions along the minor 
axis are equally good symmetry elements in the oscillator orbit. Thus we 
expect that the additional constant of the motion is not a vector, as in the 
Kepler problem, but rather a quadrupole tensor. 

We define the components of the quadrupole tensor as in the last five 
of Eqs. (27.44). Computation of the appropriate Poisson brackets then 
shows that the Q’s are constants if and only if we choose a and 6 such that 
a/8 = mK. For the orbit shown in Fig. 26, the Q’s then have the values 


Qey 3a(a? = b?) sin 27 Que = Q,, = 0 


etd 2 2 
Qo 2/3 (a? + 6?) Q: 
As expected from Fig. 26, the components of the quadrupole tensor are 
unchanged if is replaced by y + 7, and also if a and b are interchanged 
and y is replaced by y + $7. 


4a(a? — b?) cos 2y 
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QUANTUM ISOTROPIC OSCILLATOR 


Since the quantum problem separates in cartesian coordinates, the 
solution is easily found in terms of those given in Sec. 13. The energy 
levels are 


4 
B, = (n+ n(F) N=Nzt nyt nz 
Nz, Ny nz = 0,1,2,... (80.21) 


The degeneracy of E,, is easily seen to be $(n + 1)(n + 2), and the parity 
of this state is even or odd according as n is even or odd. Thus the only 
possible / values are n, n — 2, . . . down to 1 or 0, and it can be shown 
that each / occurs just once. . 

Comparison with the work of Sec. 27 shows that the dynamical 
symmetry group is SU(3).1 Since we require that a/8 = mK, the 
Casimir operator (27.47) is related to the square of the hamiltonian 
(30.20): . 

4m 

C= -3+ anzK Ht” (30.22) 
Substitution of the expression (30.21) for the nth eigenvalue of H into 
(30.22) gives . 


C = $(n? + 3n) 


Since SU(8) is of rank 2, there are two Casimir operators; they can be 
expressed in terms of two parameters, \ and u, which take on the values 
0, 1, 2,.... The general expression for the first Casimir operator 
(27.47) in terms of these parameters is? 


C = $0? + Aw + uw? + 3d + 3y) 


Thus only the representations of SU(3) with (A,u) = (7,0) are realized by 
the isotropic oscillator. The situation here is somewhat analogous to 
that in the hydrogen atom, where only the representations of O(4) with 
2 = k are realized. 

In contrast with the hydrogen atom, we have seen that there is no 
parity mixing in the isotropic oscillator, since the / values in each degen- 
erate state are either all even or all odd. This is to be expected since all 
eight of the generators, the three components of L and the five Q’s, are 
unchanged in sign on space inversion. 

The connection between the isotropic oscillator and SU(3) can also 
be made through the raising and lowering operators, at and a, defined in 
Sec. 25. There are now three of each, one pair associated with each of 
the three coordinates. The commutation relation (25.10) and the 


1J. M. Jauch and E. L. Hill, Phys. Rev. 57, 641 (1940). 
? Lipkin, op. cit., p. 126; his definition of C differs from that used here by a factor 4. 
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hamiltonian (25.11) become 


3 
3 
[a;,aj*] = 65; and H = (y a,ta; + 5) ha 


t= 1 


where 1,j = 2,y,z. Then the algebra of the nine operators a,'a ; is that of 
the generators of U(8); linear combinations of these can be found that are 
equal to H and the eight generators of SU(3).! 


PROBLEMS 


1. Show that Eq. (26.6) is valid, making use of the form (26.4) for U,(@) and the 
commutation relations for the components of r and p. 


2. Make use of the invariance of the scalar product of any two vectors under rota- 
tions, in order to show that the rows and columns of the rotation matrix R are respec- 
tively orthonormal to each other. Show also that the transpose of FR is equal to the 
inverse of R and that the determinant of F is equal to +1. 

3. Show that the three commutation relations (27.14) are valid, making use of the 
form (27.7) for L and the commutation relations for the components of r and p. 

4. Show that the three matrices S defined in Eqs. (27.11) satisfy the relations 
S XS =7hS. Show also that S? = 27. 

5. Show that the matrix elements of r for states that are rotated through the infini- 
tesimal vector @ are equal to the corresponding matrix elements of rz =r + Xr 
for the original states. 

6. Show that the matrix elements of J for states that are rotated through the infini- 
tesimal vector @ are equal to the corresponding matrix elements of Jk =J+oXJ 
for the original states. 

7. Show that the eigenvalues of S, given in (27.11) are the same as those of J, given 
in (27.26) for j = 1. Then find the most general unitary matrix that transforms 
S, into J,: US,Ut = J,. Choose the arbitrary parameters in U so that it also trans- 
forms S, into J, and S, into J,. Into what does this U transform the vector wave 
function Ue of Eq. (27.8)? 

8. Establish Eq. (27.28) by using the definitions (27.27) and the properties of the 
spherical harmonics given in Sec. 14. 


9. Obtain an explicit expression for Ur(@) = exp (—i + J/h) in the form of a 2 X 2 
matrix when J is given by Eq. (27.26) with j = 4. Let the vector @ have the magni- 
tude ¢ and the polar angles @ and y. Show explicitly that your matrix for Ur() is 
unitary and that it is equal to —1 when ¢ = 2r. 

10. Show that the matrices 4; (j = 1, . . . , 8) defined by Eqs. (27.37) and (27.40) 
satisfy the commutation relations (27.41) and (27.42). Then use these commutation 
relations (not the original matrix representation of the d;) to show that each \; com- 
mutes with the Casimir operator C defined by Eq. (27.43). 

11. Show that the 28 commutators of the eight operators (27.44), computed from the 
commutation relations between the components of r and p, agree with the commu- 
tators of the A; when the identifications (27.45) and (27.46) are adopted. 


1 For fuller discussion, see J. P. Elliott, Proc. Roy. Soc. (London) A246, 128 (1958); 
Lipkin, op. cit., chap. 4. 
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12. Use the methods and sORmAUIRS: of Sec. 28 to calculate the matrix of Clebsch- 
Gordan coefficients in the case je = $,j2 = 4. Compare your results with the wallet 
card. a) ite & yoke” ean 
13. A deuteron has spin1. Use the Wigner-Eckart theorem to find the ratios of the 
expectation values of the electric ac moment operator Q(2,0) for one: three 
orientations of the deuteron: m = 1, 0, 


14. Show that the momentum inversion is, U defined after Eq. (29. 17), that 
bas the property Uyva(p) = va(—p), is unitary. 

15. Show that, if Ya’ = Tya and yg = Tyg, then (Ya',¥e) = (Wave)* = (¥6,¥e)- 
From this, show that the norm of a state vector is unchanged by time reversal. 

16. Show explicitly that U = e~‘rSv/* satisfies the two equations (29.19). Make use 
of the commutation properties of S, and the operators S, = S, +7S;. 

17. Show by direct expansion that, for s = 3, e~'™5v/4 = —ig,, 

18. Show by a general argument that T? = +1. Make use of the form 7 = UK 
and of the fact that two successive time reversals take a state into itself so that 7? 
is a multiple of the unit matrix. Do not make use of the particular form for T 
given in Eq. (29.20): 

19. A charged particle with spin operator S is assumed to possess an electric dipole 
moment operator uS, where ,» is a numerical constant, so that the hamiltonian for this 
particle in any electric field E contains the interaction term —y»S-E. Show that 
neither space inversion nor time reversal is a symmetry operation for this particle 
moving in a spherically symmetric electrostatic potential ¢(r), even when no external 
electric field is present. 

20. Find the lowest energy eigenfunction of the hydrogen atom in the coordinate 
representation, starting from the work of Sec. 30. Proceed by finding the analog of 
Eq. (25.8) for the lowest state of the linear oscillator; then solve it in analogy with 
Eq. (25.14). 


8 


Approximation Methods 
for Bound States 


In quantum mechanics, as in classical mechanics, there are relatively few 
systems of physical interest for which the equations of motion can be 
solved exactly. Approximation methods are therefore expected to play 
an important part in virtually all applications of the theory. This 
enhances rather than diminishes the importance of those problems for 
which exact solutions can be found since, as was pointed out at the begin- 
ning of Chaps. 4 and 5, exact solutions are often useful as starting points 
for approximate calculations. They may also help to establish limits of 
validity for various approximation methods. 

In this chapter and the next we develop several approximation 
methods and apply them to illustrative problems. It is convenient to 
divide the methods into two groups according as they deal with bound 
states (considered in this chapter) or with scattering states (considered in 
Chap. 9). 
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31DSTATIONARY PERTURBATION THEORY 


The stationary perturbation theory! is concerned with finding the changes 
in the discrete energy levels and eigenfunctions of a system when a small 
disturbance is applied. It is assumed from the outset that the hamilton- 
ian H in the Schrédinger wave equation can be written as the sum of two 
parts. One of these parts, Ho, is of sufficiently simple structure that its 
Schrédinger equation can be solved, and the other part H’ is small enough 
so that it can be regarded as a perturbation on Ho. It is convenient to 
retain our old symbols wu, and E;, for the supposedly known orthonormal 
eigenfunctions and eigenvalues of the unperturbed hamiltonian Hy and to 
use y and W for the perturbed stationary wave function and energy level: 


Hy = Wy H= Ho + H’ Hou => Eixyux, (31.1) 


NONDEGENERATE CASE 


The assumption that H’ is small suggests that we expand the perturbed 
eigenfunction and eigenvalue as power series in H’. This is most con- 
veniently accomplished in terms of a parameter A, such that the zero, first, 
etc., powers of \ correspond to the zero, first, etc., orders of the perturba- 
tion calculation. We replace H’ by \H’ and express y and W as power 
series in }. We assume that these two series are continuous analytic 
functions of \ for \ between zero and one, although the results obtained 
often have the usefulness of an asymptotic series even when this is not the 
case. -The different orders of the perturbation approximation are then 
given by the coefficients of corresponding powers of A. In the final 
results, A is set equal to 1. 
The perturbed wave function and energy level are written 


YW = Wo + Avi + A2P2 + MMs + 
W = W,.+dAWi14+N2W24+MW3+ °°: 
and are substituted into the wave equation to give 


(Ho + AH’) (Wo + Avi + - * *) 
= (WotdAWit -- ot + +> +) (81.3) 


Since Eq. (31.3) is supposed to be valid for a continuous range of \, we can 
equate the coefficients of equal powers of \ on both sides to obtain a series 
of equations that represent successively higher orders of the perturbation. 


(H 0 — Wo)Wo = 0 

(Ho — Wo) = (Wi — Hho 

(Ho — Wo)¥2 = (Wi — H' i + Wao 

(Ho — Wo)vs = (W, — H')p2 + Wow + Wyo ete. 
1K. Schrédinger, Ann. Physik 80, 437 (1926). 


(31.2) 


(31.4) 
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The first of Eqs. (31.4) means that yo is any one of the unperturbed 
eigenfunctions, as expected. We therefore put 


Yo = Un Wo = En (31.5) 


This state um is discrete, since we are dealing with the perturbation of a 
bound state. It is assumed to be nondegenerate as well, although others 
of the unperturbed eigenfunctions may be degenerate or continuously 
distributed in energy. The case in which the unperturbed state yo is 
degenerate is considered later in this section. 

Two general comments can be made in regard to Eqs. (31.4).1__ First, 
any of the functions y, can have an arbitrary multiple of yo added to it 
without affecting the value of the left side and hence without otherwise 
affecting the determination of ¥, in terms of lower-order functions. We 
choose this arbitrary multiple in such a way that 


(Wows) = 0 s>0 (31.6) 


Second, the inner product of yo and the left side of each of Eqs. (31.4) is 
seen to be zero when use is made of (23.21) or (23.28). Thus the inner 


product of yo and the right side is zero in each case, and it follows with 
the help of Eqs. (31.5) and (31.6) that 


(Yo, H ’We-1) 7 
W, SS pg Ne mld s— 1.7 
ino ee ee enn 
We see that the calculation of W to a given order in H’ requires knowledge 
of y only to the next lower order.’ 


FIRST-ORDER PERTURBATION 
Equation (31.7) with s = 1 shows that 


Wi = (tm,;H'Um) = (m|H’|m) (31.8) 


which is the expectation value of H’ for the unperturbed state m. 
It is convenient to calculate y1 by expanding it in terms of the u,: 


Va = Spd Un (31.9) 


where S, defined at the beginning of Sec. 23, denotes a summation over the 
discrete set together with an integration over the continuous set of eigen- 
functions. Substitution of (31.9) into the second of Eqs. (31.4) gives 


SnAn (Ho = Em)Un = (Wi oe A’ )um 


1P.-O. Léwdin, J. Math. Phys., 6, 1341 (1965). 

2 It can also be shown from Eqs. (31.4) and (31.6) that knowledge of all the y’s up to 
y¥. is sufficient to determine all the W’s up to W241. P.-O. Léwdin, J. Molec. Spectr. 
18, 326 (1964); see also Prob. 14. 
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where a,‘ = 0 because of (31.6). Wereplace Hou, by Enun, multiply by 
uy, and integrate over all space, making use of the orthonormality of 
the w’s: 

(k|H’|m) 


a,? = 
m Ey 


km (31.10) 
SECOND-ORDER PERTURBATION 
Equation (31.7) with s = 2, together with Eqs. (31.9) and (31.10), gives 
S! (m|H’|n)(n|H’|m) 
E,, — En 
_ gy Kenlttny 
" En — E, 


Ws = Sha,(m|H'|n) = 
(31.11) 


where the prime on S,, denotes the omission of the term n = m from the 
summation and integration over n. The last step in (31.11) follows since 
it is assumed that H’ is hermitian. 

We calculate y2 by again expanding in terms of the un: 


= Sdn tn Om = 0 (31.12) 

and substituting into the third of Eqs. (31.4) to obtain 

Sin? (Ho — Em)tn = Span (Wi — H’)un + Wotlm 
As before, we replace Hota by EnUn, multiply by u*, and integrate: 

a.) (Ex — Em) = Wy — Sian (k|H’|n) k#zm 
This gives, with the help of (31.8) and (31.10), 

(k|H'ImXnlH"|m) (| lm)(m| Hm) 

(En — Ex)(Em — Ex) (En — Ex)? 


Equations (31.2) and (31.8) to (31.13), with A = 1, give the energy 
and wave function to second order in H’: 


a, = Si (31.13) 


W = En + (m\H’\m) +S) hele a 
ya Su [UI (ty 
»  {k|H"|n)(n|H’|m) 
+S GEE mp) OMe 


The primes again denote omission of k = morn = m, as the case may be. 


248 QUANTUM MECHANICS 


It is important to note that this y is not normalized, since 


a » (kl H’|m)|? 
WW) =14+ So 
to second order in H’. This occurred because of the arbitrary imposition 
of the condition (31.6). Other choices for (o,y.) would lead to expres- 
sions for y that differ from (31.14) only by a multiplicative complex 
number; they would not affect the expression for W. 


PERTURBATION OF AN OSCILLATOR 


As a simple example of the application of the first- and second-order 
perturbation theory to a nondegenerate state, we consider the perturba- 
tion of the mth energy level of the linear harmonic oscillator of Secs. 13 
and 25 by an additional energy H’ = $bz?. Theunperturbed hamiltonian 
is Hyp = p?/2u + 4Kz? (the mass is denoted by uz to avoid confusion with 
the quantum number m); the unperturbed eigenfunctions u,(z) given by 
Eq. (13.13) correspond to the eigenvalues E,, = (m + 4)h(K/z)', where 
m=0, 1, 2,.... This example is evidently a trivial one since the 
perturbed eigenfunctions and eigenvalues are given simply by replacing 
K by K + bin u(x) and E,,; it is nevertheless instructive. 

We require the matrix elements of x? between various pairs of 
harmonic-oscillator wave functions. They may be obtained with the help 
of the generating function (13.10) for the Hermite polynomials, as in 
Prob. 3 of Chap. 4, or more simply by matrix multiplication using the 
definition (22.3) and the expressions for (n|z|m) given in Eq. (13.18) or in 
Eqs. (25.12) and (25.13). We readily obtain 


(2a?2)-"[(m + 1)(m + 2)]* n=m+2 


\ 


: _ ) a?)-1(2m + 1) n=m 
(n|x?|m) = (2a2)—"[m(m — 1)]* n=m—2 
0 otherwise 


where a = (uK/h?)*. Substitution into the first of Eqs. (31.14) then 
gives for the energy to second order 


K\} b b? 


in agreement with the expansion of (m + 4)h[(K + b)/y]! to second order 
in b. 


DEGENERATE CASE 


The discussion thus far has assumed that the initial state Yo = u» is non- 
degenerate, although others of the unperturbed eigenfunctions may be 
degenerate. Suppose now that there are two states, uv, and uw, that have 
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the same unperturbed energy. Then Eq. (31.10) causes difficulty when 
k = 1 unless it happens that (/|H’|m) = 0. We consider first the case in 
which (i|H’|m) # 0, so that the results obtained above cannot be valid. 

The initial state is not specified by its unperturbed energy; the state 
may be um or uw or any linear combination of them. Let us suppose that 
the perturbation H’ removes the degeneracy in some order, so that for 
finite ) there are two states that have different energies. We assumed 
earlier that y and W gre continuous analytic functions of \ as \ — 0; thus 
each of the two states approaches a definite linear combination of u, and 
u, when \ = 0. If we happen to choose one of these two linear combina-~ 
tions as the initial state, the perturbation expansion can be carried 
through without difficulty. If, however, we start with any other linear 
combination, there is a discontinuous change at \ = 0, so that the expan- 
sions (31.2) are not valid there and the method developed above breaks 
down. Out of the infinite number of orthonormal pairs of linear com- 
binations of up, and w, the particular pair that we require will depend on 
H’. For example, if H’ is the perturbation produced by an external 
electric or magnetic field, the pair we want will depend not only on which 
kind of field is present but also on its direction. 

In order to find this pair, we write in place of Eq. (31.5) 


Yo = AmUm + au W, = En = Fi 


We then substitute this yo into the right side of the second of Eqs. (31.4) 
and take the inner product of this equation successively with u,, and w: 


((m|H'|m) — W1)am + (m|H’|l)a, = 0 
(l|H’|m)an + ({1|H’|l) — Wi)a = 0 


These homogeneous algebraic equations can be solved for a, and a; if and 
only if the determinant of their coefficients vanishes. We thus obtain a 
secular equation of just the type encountered in connection with Eq. 
(22.21); in the present situation it is a quadratic equation for W;. The 
two solutions of this equation are 


Wi = 3((m|H'|m) + (H"|D) 
+ 3[((m|H'|m) — (|H"|D))? + 4|(m|H"|D)7}* (31.16) 


Since the diagonal matrix elements of the hermitian operator H’ are real, 
both values of W, are real. They are equal if and only if 


(m|H’|m) = (|H’|I) and (m|H’|l) = 0 (31.17) 


In this case we say that the degeneracy is not removed in first order; then 
dm and a; evidently cannot be determined from the first-order calculation. 

On the other hand, if either or both of Eqs. (31.17) are not.satisfied, 
the two values of W; calculated from (31.16) are distinct, and each can be 


(31.15) 
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used in turn to calculate a,, and a; from Eqs. (31.15). We thus obtain the 
desired pair of linear combinations of the unperturbed states u, and uj. 
Next we take the inner product of u, with the second of Eqs. (31.4), where 
k ~ m, l, and make use of (31.9) to obtain 


a.) (Ey — Em) = — (klH’|m)an — (k|H" ar (31.18) 


This gives a,“ for k # m, l, and Eq. (31.6) with s = 1 is satisfied if we 
assume that an“ = a) = 0. The calculation can then be carried to 
higher order just as in the nondegenerate situation considered earlier. 


REMOVAL OF DEGENERACY IN SECOND ORDER' 


If the two values of W, obtained in the preceding subsection are the same, 
we must go to second order in order to remove the degeneracy. The inner 
product of the third of Eqs. (31.4) with un and uw yields 


Si(m|H'\n)a,0 — Woon = 0 SiH’ \n)a, — Wear = 0 
(31.19) 


The prime on S,, now denotes the omission of both of the terms n = m and 
n = | from the summation and integration over n, since we again assume 
on the basis of Eq. (31.6) that a,@) = a = 0. Substitution of a, 
from (31.18) into (31.19) gives a pair of homogeneous algebraic equations 
for dy and ay: 


» m|H’|n)|? , EL Veal WD _ 
(SF a ~mi)mt EE - (31.20) 
j (Han El) » |H’|n)|? 
5. . Fe om +(s, ao - W;) a= 0 


The secular equation associated with Eqs. (31.20) yields solutions 
for W, that are of the same general form as (31.16). The analogs of Eqs. 
(31.17) are 


oe a) eee 
— E, — E, 
and Si (m|H'|nXn|H'|l) =Q (31.21) 


En — Ex 


Unless both of these equations are satisfied, the degeneracy is removed in 
second order. 

The close analogy between the sums that appear in Eqs. (31.20) and 
(31.21) and the matrix elements that appear in Eqs. (31.15) and (31.17) 
leads to the former often being referred to as second-order matrix elements. 


1J. H. Van Vleck, Phys. Rev. 33, 467 (1929), Sec. 4. 
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A necessary and sufficient condition that degeneracy is removed in any 
given order is then either that the diagonal matrix elements of H’ for the 
two degenerate unperturbed states are unequal [so that the first of Eqs. 
(31.17) or (81.21) is not valid] or that the off-diagonal matrix element of 
H’ between these states fails to vanish [so that the second of Eqs. (31.17) 
or (31.21) is not valid]. 

So far as the latter situation is concerned, a sufficient condition that 
the degeneracy is removed in first order is that the perturbation H’ con- 
nects the initial degenerate states u,, and w in first order, so that (m|H’|l) 
fails to vanish. Similarly, the degeneracy is removed in second order if 
the perturbation connects the initial states in second order, so that there 
are one or more other states u, such that (m|H’|n) and (I|H’|n) both fail 
to vanish. 

It is not difficult to show that all the foregoing work can be general- 
ized to the removal of degeneracy in higher order and also to the case in 
which the initial state is more than doubly degenerate. 


ZEEMAN EFFECT WITHOUT ELECTRON SPIN 


The change in the energy levels of an atom caused by a uniform external 
magnetic field is called the Zeeman effect. We now consider the change 
of first order in the field strength H for a hydrogen atom. For simplicity, 
the interaction between the magnetic moment associated with the electron 
spin and the magnetic field will be neglected, although in actuality it is 
of the same order of magnitude as the term we calculate. The effect of 
electron spin will be included in Sec. 48. 

A constant magnetic field can be represented by the vector potential 


since H = VxA. The divergence of (31.22) is zero, so that the terms 
involving A that appear in the hamiltonian (24.39) for an electron of 
charge —e and reduced mass yu are 

teh CF ger, e | e? 

Tp Uh gaa > oe p+ 7a (HX 1): (HX 2) 


e é 
Soa ET Be 


ca H?r? sin? @ (31.23) 


where L = r x p, @ is the angle between r and H, and e is a positive 
quantity. Since we wish to work only to first order in H, we can put 


ho 14; 
H’ = 57 H-L (31.24) 


The energy eigenfunctions of the unperturbed hydrogen atom are 
usually chosen to be eigenstates of L,, with eigenvalues mh, where m is 
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the magnetic quantum number introduced in Sec. 14. However, this is 
a sensible choice only if the magnetic field is in the z direction, since then 
Eq. (31.8) becomes 


Wi = (m|H"|m) = aE Hmi (31.25) 


and the degeneracy of the 21 + 1 states of given n and / is removed in 
first order. On the other hand, if the magnetic field is not in the z direc- 
tion, the states labeled by m are not a suitable starting point for the 
degenerate-perturbation calculation. This provides an example of the 
remark at the end of the paragraph above Eqs. (31.15) that the unper- 
turbed initial states must be chosen with the direction of the perturbing 
field in mind. 


FIRST-ORDER STARK EFFECT IN HYDROGEN 


The change in the energy levels of an atom caused by a uniform external 
electric field of strength E is called the Stark effect. Calculation of the 
change of first order in E in the case of a hydrogen atom is somewhat 
more complicated than for a magnetic field. It is still desirable to choose 
the unperturbed initial states with the direction of the perturbing field 
in mind; as before, we take the z axis along the field and use eigenstates 
of L,. However, there is now an additional complication from the fact 
that H’, given by Eq. (31.26) below, is odd with respect to space inversion, 
so that the expectation value of H’ is zero for all states that have definite 
parities. 

The perturbation H’ is now the extra energy of the nucleus and 
electron in the external field and is readily shown to be 


H' = ekz = eEr cos @ (31.26) 


where the polar axis and E are in the direction of positive z and e is again 
a positive quantity. The discussion of Sec. 14 showed that wave func- 
tions for any spherically symmetric potential energy, when expressed in 
spherical harmonics, have even or odd parity according as the azimuthal 
quantum number / is even or odd. Since the perturbation (31.26) is odd 
with respect to inversion, the only matrix elements of H’ that fail to 
vanish are those for unperturbed states that have opposite parities. 
This shows that a nondegenerate state, such as the ground state (n = 1) 
of hydrogen which has even parity, has no first-order Stark effect. 

The first excited state (n = 2) of hydrogen is fourfold degenerate; 
the quantum numbers /, m have the values (0,0), (1,0), (1,1), and (1,—1). 
We now show quite generally that nonvanishing off-diagonal matrix ele- 
ments of H’ exist only for states that have the same quantum number m. 
It is apparent from Eqs. (24.31) that z commutes with the z component 
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of the orbital angular momentum L, = xp, — ypz, so that [L,,H’] = 0. 
The matrix element of this equation between states 7 and k in a repre- 
sentation in which L, is diagonal is (m; — m,)&(j|H'|k) =0, so that 
(j|H'|k) = O unless m; = m,. Thus all 16 matrix elements of H’ between 
the four degenerate unperturbed states given above are zero except for 
the off-diagonal elements between the first two (opposite parity) states. 


PERTURBED ENERGY LEVELS 

An extension of the discussion that led to Eq. (31.15) shows that the 
secular equation for the first-order Stark effect of the n = 2 state of 
hydrogen is 


—W, (0,0\H"|1,0) 0 0 

(1,0|H'10,0) —Wi 0 0 | _ 
‘ ; _w. 9 |=? (31.27) 
0 0 0 —-W, 


The nonvanishing matrix elements of H’ are 


{1,0|H’|0,0) = ° Si 0)* = eEfuz,,(r)r cos 8 usoo(r) dr 


i aa aah jee ‘(2 _ =) e7"!%y? dw dr 


= —BeEag 


The four roots of Eq. (31.27) are 0, 0, 3eEao, and —3eEdao, so that 
half of the fourfold degeneracy is removed in first order. It is easily 
seen that the first two values of W1 correspond to any two linearly inde- 
pendent combinations of ue. and we2,1,-1 and that the third and fourth 
values correspond to 2—?(us00 — we10) and 2-* (we200 + uei0), respectively. 
This means that a hydrogen atom in its first excited state behaves as 
though it has a permanent electric dipole moment of magnitude 3ed» that 
can be oriented in three different ways: one state parallel to the external 
field, one state antiparallel to the field, and two states with zero com- 
ponent along the field. 


OCCURRENCE OF PERMANENT ELECTRIC DIPOLE MOMENTS 


It follows from the foregoing discussion that an atom can have a per- 
manent electric dipole moment (energy change proportional to E) only 
when its unperturbed state is nondegenerate and does not have well- 
defined parity or is degenerate and contains components of opposite 
parities. On the other hand, any atom can have an induced electric 
dipole moment. This means that the dipole moment is proportional to 
E or that the energy of the atom in the external electric field is propor- 
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tional to £2. In accordance with Eq. (31.11), a second-order perturbation 
energy of this type is expected for any system. 

The discussion of Sec. 29 shows that if the unitary space-inversion 
operator U; commutes with the hamiltonian for any system, the energy 
eigenstates can be chosen to have well-defined parities. With the excep- 
tion of the weak interactions, which among other things are responsible 
for radioactive beta decay, all known hamiltonians commute with U;. 
Then it is expected that an atom or nucleus in its ground state will at 
most possess an extremely small permanent electric dipole moment 
(< e X 10-%° em) that might arise from the weak interactions. Thus 
far, none has been found experimentally. 

It is also possible, as in the case of the hydrogen atom, that unper- 
turbed degenerate states of opposite parities can give rise to a per- 
manent electric dipole moment. However, this occurs only in hydrogen 
because of a special dynamical symmetry (see Sec. 30) and even there 
does not occur for the ground state. Such degeneracy could also occur 
accidentally but is then very improbable on statistical grounds. In 
some molecules, however, there is a group of nearly degenerate rotational 
states of both parities. If these energy levels are closely spaced in com- 
parison either with the thermal energy of the molecule or with the energy 
associated with the applied electric field, they can give rise to a permanent 
electric dipole moment.! In this case the direction of the dipole is related 
to a figure axis of the molecule; the dipole would, for example, be along 
the line joining the two atoms in an unsymmetrical molecule such as 
hydrogen chloride. 

In the systems considered thus far, the electric dipole moment is 
either along the applied electric field (hydrogen atom in an excited state) 
or along a molecular figure axis (hydrogen chloride). For an elementary 
particle or a nucleus, it is reasonable to assume that the ground state is 
nondegenerate and that the only figure axis is that defined by the spin 
operator S. Then the electric dipole moment operator must be propor- 
tional to S. This means that, for such a particle moving in a static, 
spherically symmetric electrostatic potential ¢(r), the hamiltonian con- 
tains a term of the form uS + V¢, where uw is a numerical constant. Thus 
even in this environment neither space inversion nor time reversal is a 
symmetry operation (see Prob. 19, Chap. 7). An extension of this argu- 
ment shows that although a particle or nucleus may possess an electric 
charge, magnetic dipole moment, electric quadrupole moment, etc., it 
cannot possess a magnetic monopole, electric dipole moment, magnetic 
quadrupole moment, etc., if either or both of space inversion and time 
reversal are symmetry operations. This supplements the remark at 


1J. H. Van Vleck, ‘‘The Theory of Electric and Magnetic Susceptibilities,”’ p. 154 
(footnote 28), sec. 48, (Oxford, New York, 1932). 
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the end of Sec. 28 which relates allowed multipole moments to the total 
angular momentum quantum number. 


32LITHE VARIATION METHOD 


The variation method can be used for the approximate determination of 
the lowest or ground-state energy level of a system when there is no 
closely related problem that is capable of exact solution, so that the per- 
turbation method is inapplicable. It can also be applied to systems 
that are described by a nonseparable Schrédinger equation, in which 
case numerical solutions are extremely arduous and the WKB method 
(Sec. 34) cannot be used. 


EXPECTATION VALUE OF THE ENERGY 


It was shown in Sec. 10 that, if an arbitrary normalized function y is 
expanded in energy eigenfunctions 


= > Agus where Hug = Eug (32.1) . 
E 


and the uz form a complete orthonormal set, the expectation value of H 
for the function y is given by 


(H) = fy*Hy dr = )) El Az? (32.2) 
E 


where the integration is extended over the entire range of all the coordi- 
nates of the system. It is assumed for convenience in Eqs. (32.1) and 
(32.2) that the energy eigenvalues are all discrete; this can be accom- 
plished by enclosing the system in a box (Sec. 10), or the summation can 
be replaced by the symbol § (Sec. 23). 

A useful inequality can be derived from Eq. (32.2) by replacing 
each eigenvalue E in the summation on the right side by the lowest 
eigenvalue Ko: 


(H) >= )) EolAz|? = Eo |Azl? (32.3) 
E E 


Since »» |Az|? = 1 for a normalized function y, as was shown in Sec. 10, 


E 
(32.3) yields the inequality 


Eo < Sy*Hy dr (32.4) 
In the event that y is not normalized, (32.4) evidently can be rewritten 
as 
* 
Ey < {Y*HY dr (32.5) 


Pe i tne Anil 
~ fll? dr 
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The variation method! consists in evaluating the integrals on the 
right side of (32.4) or (32.5) with a trial function y that depends on a 
number of parameters and varying these parameters until the expectation 
value of the energy is a minimum. The result is an upper limit for the 
ground-state energy of the system, which is likely to be close if the 
form of the trial function resembles that of the eigenfunction (see Prob. 5). 
Thus it is important to make use of any available information or physical 
intuition in choosing the trial function.? 


APPLICATION TO EXCITED STATES 


The variation method can also be used to obtain an upper limit for one 
of the higher energy levels if the trial function is orthogonal to the eigen- 
functions of all the lower states. Suppose that the energy levels are 
arranged in an ascending series: Ko, E;, E2, .... Then, if y is orthog- 
onal to uz, fori = 0, 1, ..., n, it is easily seen from (32.1) that the 
corresponding expansion coefficients Ag, are all zero. An inequality can 
be derived from (32.2) by replacing each eigenvalue E in the summation 
on the right by E41, with the result that the expectation value of the 
energy is an upper limit on this eigenvalue. 

The trial function y — uz,Juz,y dr is evidently orthogonal to uz, so 
that, if the lowest eigenfunction is known either from an exact solution or 


to a sufficiently good approximation from a variation calculation, an . 


upper limit for the energy of the first excited state can be computed. 
Trial functions that are orthogonal to any number of known eigenfunc- 
tions are easily found in this way. 

It is sometimes possible to divide the energy eigenfunctions into 
groups such that any member of one group is orthogonal to any member 
of any other group. Suppose that there is a hermitian operator F that 
commutes with H (FH — HF = 0); then, from a theorem of Sec. 22, F 
and H can be diagonalized simultaneously and have common eigenfunc- 
tions. Now any two eigenfunctions of F that correspond to different 
eigenvalues are orthogonal to each other. Thus a trial function that is 
constructed entirely from eigenfunctions of F that correspond to a given 
eigenvalue is orthogonal to all other eigenfunctions that correspond to 


1 The method was originally applied by Lord Rayleigh in 1873 to the computation of 
the vibration frequencies of mechanical systems: ‘Theory of Sound,”’ 2d rev. ed., vol. 
1, sec. 88 (Macmillan, London, 1937; reprinted by Dover, New York). See also 
W. Ritz, J. Reine Angew. Math. 136, 1 (1908); P. M. Morse and H. Feshbach, ‘‘Meth- 
ods of Theoretical Physics,” sec. 9.4 (McGraw-Hill, New York, 1953). 

2 For an extension of this method that brackets several of the lowest eigenvalues, see 
J. K. L. MacDonald, Phys. Rev. 43, 830 (1933). Lower bounds have been studied by 
P.-O. Léwdin, Phys. Rev. 189, A357 (1965), who also gives references to earlier work. 
3 This is shown explicitly in Eq. (10.4) for the energy operator, and the proof given 
there is easily extended to any hermitian operator. 
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different eigenvalues of F, and will provide an upper limit for the lowest 
energy eigenvalue that is associated with this eigenvalue of F. The fore- 
going results are useful when the operator F is one whose eigenfunctions 
are easily recognizable by some simple property, such as, for example, the 
symmetry in case F is the angular momentum or the parity. Then a 
trial function with angular dependence corresponding to a particular 
angular momentum, or with a particular parity, can easily be written 
and gives an upper limit for the lowest energy level that has this angular 
momentum or parity. 


GROUND STATE OF HELIUM 


As a first example, we use the variation method with a simple trial func- 
tion to obtain an upper limit for the energy of the ground state of 
the helium atom. The helium atom consists of a nucleus of charge +2e 
surrounded by two electrons; from Eq. (16.1) we find that its hamiltonian 
is (neglecting the motion of the nucleus) 

2 2 

H=-— Le (Vi? + Vo?) — 2e? (4 + ~ + = (32.6) 
where r; and rz are the position vectors of the two electrons with respect 
to the nucleus as origin, and ri: = |r1 — r2| is the distance between the 
two electrons. 

If the interaction energy ¢?/r1. between the two electrons were not 
present, the ground-state eigenfunction of H would be the product of 
two normalized hydrogenic wave functions wio0(f1)U100(F2) given in Eq. 
(16.24): 


¥(ri,r2) = SZ eetlediort (32.7) 


with Z = 2. We shall use (32.7) as a trial function and permit Z to be 
the variation parameter so that it is not necessarily equal to 2. 

It follows from Prob. 13, Chap. 4, that the expectation values of the 
kinetic and potential energies for the ground state of a hydrogen atom are 
e?/2do and ~—e?/ao, respectively; the corresponding hydrogen wave func- 
tion is (wao*)-te7/*, The expectation value of either of the kinetic- 
energy operators in (32.6) for the function (32.7) is obtained most easily 
by noting that operation with the laplacian gives a result that is inversely 
proportional to the square of the length scale of the wave function; since 
the scale of (32.7) is smaller than that of the hydrogen wave function by a 
factor of Z, the expectation value of each of the kinetic-energy operators 
is e?Z?/2ao. Similarly, the factors 1/r make the expectation values of the 
nuclear potential-energy operators inversely proportional to the length 
scale; there is also an additional factor 2 from the nuclear charge, so that 
each one is — 2e?Z/ap. 
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ELECTRON INTERACTION ENERGY 
The expectation value of the interaction energy between the electrons is 


oy 
[fv Gute) & ver) din dirs 


3 \2 
= (2 :) e? / / * e-OZiadntr) dr, dir, (32.8) 
0 12 


This integral is most easily evaluated by regarding it as the mutual 
electrostatic energy of two overlapping, spherically symmetric, charge 
distributions, in which case simplifications from the theory of electro- 
statics can be introduced. 

A more general way of performing the integration, which can also 
be used for wave functions that are not spherically symmetric, consists 
in expanding 1/ry2. in spherical harmonics. 


| 
| 

Fhe 

wr 


Te t 
(2) P,(cos @) m1 > Pe 
5 (32.9) 


I 
31s 


ih448 


=) 


T1 t 
(2) P.(cos @) mr <1 
Te 


Tig 


where @ is the angle between ri and fe, cos 6 = cos 6; cos 0, + sin 6; 
sin 02 cos(¢1 — $2), and 41, ¢1 and 62, $2 are the polar angles of the vectors 
r; and re, respectively.! It can be shown? that 


P,(cos 6) = P;(cos 6:)P;(cos 42) 
l 
+2 > aie (cos 6:)P;"(cos 02) cos m(¢1 — ¢2) (32.10) 
mal 


When (32.9) and (32.10) are substituted into (32.8) and use is made 

of the orthogonality of the spherical harmonics, the integration over the 
polar angles of r, causes all terms to vanish except that for which | and 
m are zero. The integral on the right side of (32.8) becomes 


(47)? CR ~ e—(2Z /a9) (ritra) po? dre + i = @7 (22 1aq) (rit rs) p92 ars) ry dr, 
0 


which can be evaluated as 57?a,°/8Z5. Thus the alecteon interaction 
energy has the expectation value 5e?Z/8ap. 


1 Equations (32.9) follow at once from the generating function (14.10) for the Legendre 
polynomials; the expression for cos 6 is simply obtained from the scalar product of 
the vectors ri and re in rectangular coordinates. 

2 E. T. Whittaker and G. N. Watson, ‘A Course of Modern Analysis,” 4th ed., p. 328 
(Cambridge, London, 1935). 


APPROXIMATION METHODS FOR BOUND STATES 259 


VARIATION OF THE PARAMETER Z 


We now have the result that the expectation value of the hamiltonian 
(32.6) for the trial function (32.7) is 
272 2 2 
(H) = e?Z? _ 4e Z , 5e°Z 


ao ao 8ao 


= © (a — 33) 
~-2@-% 


Differentiation with respect to Z shows that this is a minimum when 
Z = 2} = 1.69. Thus the lowest upper limit for the ground-state energy 
of the helium atom obtainable with this trial function is 


carr” = 9.95% 
(94)? 2.85 


The experimental value for the minimum energy required to remove both | 
electrons from a helium atom is 2.904e?/ao, so that our limit is about 1.9 
percent high. The most careful variation calculation of the ground-state 
energy of helium gives a result in excellent agreement with experiment’ 
and provides an important verification of the theory of quantum 
mechanics. 

The result that hydrogenic wave functions give the best energy 
value when Z = 23 rather than 2 indicates that each electron screens 
the nucleus from the other electron, the effective nuclear charge being 
reduced by +5; of an electronic charge. 

If the electron interaction term e?/ri2 is regarded as a perturbation, 
the first-order perturbation energy is given by (H) with Z = 2 and is 
—2.75e?/ao, which is 5.3 percent above the experimental value. It is 
apparent that, in general, the first-order perturbation calculation is 
equivalent to a nonoptimal variation calculation. 


VAN DER WAALS INTERACTION 


As our second example of the application of the variation method, we 
calculate the van der Waals (long-range) interaction between two hydro- 
gen atoms in their ground states. It is convenient to consider this prob- 
lem first by means of the perturbation theory, since it is then easier to 
see that the leading term in the energy at great separation distances varies 
inversely as the sixth power of this distance. Also, it turns out that the 
perturbation theory and the variation method provide opposite limits for 
the coefficient of this term. 

We assume that the nuclei of the two hydrogen atoms, A and B, are 
fixed in space a distance R apart and that the z axis is chosen parallel to 
the line through A and B. Then if r; is the vector displacement of elec- 
tron 1 from nucleus A and r, is the vector displacement of electron 2 


1. A. Hylleraas, Z. Physik 65, 209 (1930). J. Sucher and H. M. Foley, Phys. Rev. 
96, 966 (1954), discuss a number of corrections and give references to more recent work. 


260 QUANTUM MECHANICS 


z axis 
preteanteersencnanae tipi 


A R B 


Fig.27. Two hydrogen atoms, with nuclei at A and B separated by a distance 
R, have electrons at 1 and 2; their interaction is given by H’ in Eq. (32.11). 


from nucleus B (see Fig. 27), the hamiltonian for the two electrons can 
be written 


(32.11) 


The unperturbed hamiltonian H» has the solution 
Uo(¥1,02) = Uro0(¥1) U100(F2) 


for two noninteracting hydrogen atoms in their ground states. We 
regard the interaction terms H’ as a perturbation; this is equivalent to 
assuming that R > apo. 

Since we are interested in the leading term in the interaction energy 
when £# is large, we expand H’ in powers of 1/R and keep the lowest 
terms. 


if: (x2 — 41)? + (y2 — yi)? + (22 — 21)? [4 
R? 
221 ry? —} 222 Te” 3 
~(1 - B+) ~ (14343) 
2 
rE} a (1122 + Yuy2 — 22122) (32.12) 


The last term is the interaction energy of two electric dipoles that corre- 
spond to the instantaneous configurations of the two atoms.! 
It is apparent at once that the expectation value of the leading 


1 The neglected terms in the expansion (32.12) that vary like 1/R* are the dipole- 
quadrupole interaction; the 1/R*® terms are the quadrupole-quadrupole interaction, 
etc. 
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term in H’ for the state uo(ri,f2) is zero, since wo is an even function 
of r, and r, and H’ is an odd function of r; and r2 separately. It can 
also be shown that all the neglected higher terms in H’ have zero expecta- 
tion value for uo, since these terms can be expressed as spherical harmonics 
of order different from zero. Thus the leading term in the interaction 
energy is the second-order perturbation of the dipole-dipole term, which 
is proportional to H’? and hence varies like 1/2R°.* 


PERTURBATION CALCULATION 


From Eq. (31.11), the second-order change in the energy of the two 
hydrogen atoms is 


(O|H’|n)|? 
Eo — En 


where the index n refers to all states of the pair of unperturbed hydrogen 
atoms (including dissociated states), and the ground state uo is excluded 
from the generalized summation. It is apparent that W(R) is negative, 
since E, < E, and the numerator of each term in (32.13) is positive. 
We thus conclude that the interaction is attractive and proportional to 
1/R*, when R is large; both of these conclusions can be shown to be 
valid for any pair of atoms that are in nondegenerate, spherically sym- 
metric ground states. 

We can obtain an upper limit on the positive quantity — W(R) by 
replacing each E, in (32.13) by the energy E,: of the lowest excited state 
of the two hydrogen atoms for which (0|H’|n*) is different from zero.’ 
Then the denominator can be taken outside of the summation, which 
can be evaluated as a matrix product: . 


S.\O|H’|n)|2 = S,(0|H’|nXn|H’|0) — ({0|H"|0))? 
= (0|H’?|0) — ({0|H’|0))? 
Since we have seen that (0|H’|0) = 0, we have that 


S (0|H"(0) 
WR) < ee 


W(R) = Ss. (32.13) 


(32.14) 
The state n* is that in which both atoms are excited to states of 
principal quantum number 2, so that Ey = —2(e?/2da0), En» = —2(e?/8a0), 
and E,+ — Eo = 3e?/4ao. From (32.12) we have 
4 
aS Rs (xy222? + yr?yo? + 42222? + Qezayiy2 — -° +) (82.15) 


1F, London, Z. Phystk 68, 245 (1930). 
2 A. Unséld, Z. Physik 48, 563 (1927). 
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The expectation value of the cross-product terms like xir2y1y2 is zero 
since these terms are odd functions of one of the cartesian components 
of r, or rx. The first three terms in the parenthesis of (32.15) each 
lead to a product of two identical factors that are equal to 


Jx?|uroo(r) |? d’r = $fr?|uroo(r) |? d?r 
nea | © kp—2r] 2 se 
=a ho ree—*rladprr? dr = ag 


so that (0|H’?|0) = 6e‘ao4/R®. Substitution into (37.14) gives 


8e7ay 5 


W(R) => - Re (32.16) 


VARIATION CALCULATION 


An upper limit on W(R) can always be obtained by the variation method. 
It is apparent, however, that some judgment must be used in the choice 
of the trial function y; thus, if y does not depend on R, the dependence of 
the expectation value of the energy on F will be like that of H’, that is, 
1/R*. An upper limit with this R dependence is of no value to us, since 
what we want to determine is a limit on the coefficient of the 1/R® 
interaction. A useful choice for y will be one in which there is a term 
proportional to H’, since there will then be terms in the expectation 
value that are proportional to H’? and hence vary like 1/R°®. 
We choose for the trial function 


W(F1,82) = U100(¥1)U100(F2) (1 + AH’) 


where A is to be the variation parameter. Since this y is not normalized, 
we use (32.5) rather than (32.4) and obtain 


Sfucl + AH’)(Hy + A’)uo(1 + AX’) dr, dir, 
Bot WB) Ss Thue + AHS Ory Ore 
(32.17) 
where again wo is the product of the ground-state hydrogen wave func- 
tions, and A is assumed to be real. The right side of (32.17) can be 
written 
Eo + 2A(0|H’2|0) + A2(0|H’HoH’|0) 
1 + A*0|H’2/0) 
since uo is a normalized eigenfunction of Ho with the eigenvalue 
Eo = —e?/ ao 


and (0|H’|0) = (0|H’?|0) = 0. It is easily seen that (0|H’HoH’|0) is a 
sum of squares of factors of the form Jfuoo(r)xHoxuioo(r) d?r; this can 
be shown by direct computation to be zero. 


(32.18) 
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Since we are interested only in terms of order H’?, we expand the 
denominator of (32.18) to obtain 
(Eo + 2A(0|H’?|0))(1 + A%0|H’?0))~* 
= Ey + (2A — E,A?)(0|H"2(0) (32.19) 
If we remember that Ep is negative, we find that (32.19) has a minimum 


with respect to variation of A when A = 1/Zo, in which case (32.17) 
becomes 


(0|H’?|0) 6e7a9° 
Ey + W(R) < B+ OED = By pe 


(32.20) 


Thus in (32.16) and (32.20) we have both upper and lower limits on the 
interaction energy: 


8e7ao5 6e7ay5 

ee WR) S ~ ee 

More careful variation calculations have shown that the numerical 
coefficient in W(R) is very nearly 6.50." 


(32.21) 


33CJALTERNATIVE TREATMENT OF THE PERTURBATION SERIES 


We have seen in Sec. 32 how the expression (31.11) for the second-order 
perturbed energy can give a useful limit even when the generalized sum- 
mation S$ cannot be carried out. In some situations, however, closed- 
form expressions for W: and ¥, both of which otherwise involve infinite 
summations, can be obtained. We illustrate this first with the problem 
of the second-order Stark effect of a hydrogen atom in its ground state 
and then generalize the procedure to a wider class of situations. 


SECOND-ORDER STARK EFFECT IN HYDROGEN 


The ground state of a hydrogen atom is nondegenerate, and the first- 
order perturbed energy in a uniform external electric field is zero. Our 
problem then is to calculate Eq. (31.11): . 


_ ev KOLA’ In)? 
W2=S, Eo = B. 


1 See L. Pauling and E. B. Wilson, Jr., “Introduction to Quantum Mechanics,’’ sec. 
47a (McGraw-Hill, New York, 1935). The result (32.21) is not strictly correct, since 
H’ in (32.12) included only the static dipole-dipole interaction between the two atoms. 
In reality, there is also an effect of retardation, which arises from the finite speed of 
propagation of the electromagnetic interaction between the two dipoles. This causes 
W(R) to fall off like ~1/R7 when R is large in comparison with the electromagnetic 
wavelength associated with an atomic transition frequency: R > heao/e? = 137ao. 
At such large distances the interaction is uninterestingly small, so that (32.21) actually 
provides useful limits on W(R). See H. B. G. Casimir and D. Polder, Phys. Rev. 73, 
360 (1948). : 


(33.1) 
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The ground state ket |0) in the coordinate representation is 
{r|0) = Uro0(r) = (rao?)—te-"/%, 


and Ey = —e?/2a. Equation (33.1) and the variation method are 
used in Prob. 9 to obtain lower and upper bounds, respectively, for W2. 

Instead of working directly with Eq. (33.1), we shall first find y1 
by solving the second of Eqs. (31.4) subject to the condition (31.6), and 
then make use of (31.7).1. With H’ given by (31.26), Wi = 0, and the 
second of Eqs. (31.4) is an inhomogeneous differential equation for y1: 


Bg Ot E _ 
(- om v2 — nl .) ¥i = —eEr cos @ u100 (33.2) 
We show first that the only angle dependence of y; is through a multiply- 
ing factor cos 6. One way of seeing this is to expand y as a series of 
functions of r times spherical harmonics in @, ¢ and to note that each term 
is an eigenfunction of the angular part of V?. Then the left side of (33.2) 
is a similar series of spherical harmonics, and the only term we wish to 
retain is that which has the same angular dependence as the right side: 
Y10(9,¢) or cos 6. An alternative way of seeing the same thing is to note 
that ¥: given by (31.9) is a sum over only those unperturbed states u, for 
which a, fails to vanish; in accordance with (31.10), this means that up, 
is proportional to cos @ since the unperturbed state is spherically sym- 
metrical. Thus each term in the series for ¥1 is proportional to cos 8, 
and hence y is also. 
We can thus write 


¥i(t) = f(r) cos 0 (33.3) 


and Eq. (31.6) is automatically satisfied. Substitution of (33.3) into 
(33.2) gives 

af  2df 2 1, 2 ie 

ar? + r ar ait + ae” aa = €Go (way)! re (33.4) 
The solution of Eq. (33.4) is expected to have the form of a power series 
in r multiplied by e~’/%; further, the series is expected to start with the 
first or higher power of r, since otherwise (33.3) will be singular at the 
origin. It turns out that the series terminates after two terms, so that the 
solution of (33.4) is 


Jr) = —(ras)-+E (aur + Hr)e-7 (33.5) 


1 This approach appears to have been first published by M. Kotani, “Quantum 
Mechanics,” vol. I, p. 127 (Yuwanami Book Co., Tokyo, 1951). 
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as may be verified by substitution. Thus the wave function that is 
correct through first order in E is 


(wa?) te! 20 E — : (aor + ir) cos | (33.6) 


Substitution of the expression for ¥1 obtained from Eqs. (33.3) and 
(33.5) into (31.7) gives for the second-order perturbed energy 


W. = eE(xao*)—tfr cos? Of(r)e*/* d3r 
2 fe 
oS = if (aor? + 4r5)e~?r/% dr 
0 


—— 9 Eas? (33.7) 


This expression for the second-order Stark effect was first obtained by 
separation of the wave equation in parabolic coordinates.’ 


- POLARIZABILITY OF HYDROGEN 


It was remarked at the end of Sec. 31 that the second-order Stark effect 
could be interpreted in terms of an induced electric dipole moment. 
This induced moment is proportional to the applied electric field and in 
the same direction, and the ratio a of dipole moment to field strength 
is called the polarizability. It is easily seen that these conditions hold 
exactly for a charged isotropic harmonic oscillator and that the energy 
change in this case is —}aE? (Prob. 1). For a general system, in which 
the energy change is not exactly proportional to E?, it is still true that 


W, = —4ak? (33.8) 
Comparison of Eqs. (33.7) and (33.8) shows that 
a = $a? (33.9) 


for a hydrogen atom its ground state (see Prob. 12). 


1G. Wentzel, Z. Physik 38, 518 (1926); I. Waller, Z. Physik $8, 635 (1926); P. S. 
Epstein, Phys. Rev. 28, 695 (1926). The general expression for the first- and second- 
order Stark effect with nuclear charge Z, in terms of the parabolic quantum numbers 
of Sec. 16, is 3n(ni — nz) (eEao/Z) — (E%ac?/16Z*)n4{17n? — 3(ni — nz)? — 9m? + 19]. 
It should be noted that such a series in powers of E cannot converge since, strictly 
speaking, the system has no bound states. This is so because the electron of a hydro- 
gen atom placed in a uniform electric field can gain enough energy by moving in the 
direction —E to compensate for its binding energy. In other words, the electron can 
tunnel through the potential barrier created by the nuclear coulomb field and the 
external electric field, as discussed in Sec. 17. The rate of spontaneous dissociation 
for a hydrogen atom in an electric field was calculated by J. R. Oppenheimer, Phys. 
Rev. $1, 66 (1928), and is exceedingly small for fields of laboratory strength. Even 
though the series for the perturbed energy in powers of E does not converge, it is 
useful for ordinary field strengths. 
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METHOD OF DALGARNO AND LEWIS 


The foregoing procedure can be generalized in the following way.! We 
start with Eq. (31.11), which is applicable to the ground state of any 
system since in all known cases this state is nondegenerate: 


_ 1 LH Inn") 


ook (33.10) 
Suppose now that an operator F can be found such that 
pe = (rio) (33.11) 


for all states n other than the ground state. Substitution into (33.10) 
then gives 
Ws = SiO|H'|n)(n|F\0) = (0|H'F|0) — (0|H’|0)0|F|0) (33.12) 


where the term n = 0 has first been added in to make the summation 
complete and then subtracted out. Thus, if F can be found, the evalu- 
ation of W: is greatly simplified, since only integrals over the unperturbed 
ground-state wave function need be evaluated. 

Equation (33.11) can be written as 


(n|H'|0) = (Eo — E,){n|F|0) = (n|[F,H0]|0) 
which is evidently valid if F satisfies the operator equation 
[F,Ho] = H’'+C 


where C is any constant. However, this last equation is unnecessarily 
general; it is enough that F satisfy the much simpler equation 


[F,H0]|0) = H’|0) + C0) (33.13) 


from which it follows that C = — (0|H’|0). 
We now define a new ket |1), which is the result of operating on |0) 
with F. Then Eq. (33.13) may be written 
(Eo — Ho)|1) = H'|0) — (0|H’}0)|0) where |1) = F/O) 
(33.14) 
The ket |1) can evidently have an arbitrary multiple of (0) added to it; we 
choose this multiple so that (0/1) = 0. If now Eq. (33.14), which is an 
inhomogeneous differential equation, can be solved for |1), the second- 
order perturbed energy (33.12) can be written in terms of it as 
= (0|H"|1) (33.15) 


1A, Dalgarno and J. T. Lewis, Proc. Roy. Soc. (London) A238, 70 (1955); C. Schwartz, 
Ann. Phys. (N.Y.) 6, 156 (1959). 
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In similar fashion the series (31.9) for 1 can be written in closed form: 


= 5, EY = SiImynlF0) 
= Flo) ~ loXolF0) = |1) (83.16) 


It is apparent that Eqs. (33.15) and (33.16) are consistent with Eq. (31.7), 
as of course they must be. 

The Dalgarno-Lewis method thus replaces the evaluation of the 
infinite summation (31.9) by the solution of the inhomogeneous differ- 
ential equation (33.14). The latter procedure may be much simpler even 
when it cannot be done in closed form, as with (33.4). 


THIRD-ORDER PERTURBED ENERGY 

The ket |1) = F|O) is all that is needed to find the third-order perturbed 
energy W3. We make use of Eas. (81.7), (31.12), (81.13), and the com- 
plex conjugate of (33.11) to write 


= (uo, H’p2) 
= st MOH 'Ik) ( » (k|H’|n)n|H’|0) ee) 
hoo BeX Ey — E, Ey — Ex 


= SiO|Ft|k)(SikIH’ InXin|F|0) — (kF|0X0|H"|0)) 
= (O|F'H'F\0) — (0\F*|0X0|H’F|0) — (0\FH’|0X0|F|0) 
— (O|FF|0X(0|H’|0) + 2(0|F10X0|H’|0)(0|F|0) 
= (1|H"|1) — (1|1)0|H7"|0) (33.17) 


since (0|1) = 0. We thus obtain a closed expression for W; as well. 


INTERACTION OF A HYDROGEN ATOM AND A POINT CHARGE 


As an example of this method, we now calculate the change in energy of a 
hydrogen atom in its ground state when a point charge Ze is placed at a 
fixed distance &. The perturbation is © 

| 

R (R? + r? — 2Rr cos 6) 


2 1 . 
a: -) (z) P,(cos 6) (33.18) 


provided that R > r or, equivalently, that R is much greater than do. 


1 This result can also be obtained directly from Eqs. (31.4) and (31.6) as a special case 
of the formula derived in Prob. 14. 
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From the structure of Eq. (33.14), we expect that the coordinate 
representation of the ket |1) can be written in the form 


{r|1) = 2 fi(r)Pi(cos 6) . (33.19) 


Substitution of (33.19) into (33.14) leads to the following differential 
equation for f,(r): 


af, | 2dfr_ Ul+1) oe 2 en 2 ; 
drt dr eT edt aad = ~ aR rag” 
(33.20) 


—1r | ag 


As expected, this agrees with Eq. (33.14) when we put / = 1 and E = 
—Ze/R*. 

A solution of Eq. (33.20) is easily found in analogy with (33.5) and 
again contains only two terms. Substitution into (33.19) gives 


pitl 


(r|1) = > am msicams (s + ir) e-lP,(cos 6) (33.21) 


which, in accordance with (33.16), is equal to ¥:(r). Similarly, Eq. 
(33.15) shows that W: is given by 


w (d+ 2)(21 + 1)! ao?! 
— 9gaitr Rt 


= (0|H’|1) = —Z?e? (33.22) 


t=1 


Again, the leading term (J = 1) agrees with (33.7) when E = —Ze/R?. 

It should be noted that, although Eq. (33.22) gives the first two 
terms of an asymptotic series in 1/R correctly, the third term, which is 
proportional to 1/R8, is dominated by the leading term of W3. Equation 
(33.17) shows that W; = (1|H’|1) in this case and that the leading term 
for large & is proportional to 1/R7 (see Prob. 15).} 


34CTHE WKB APPROXIMATION 


In the development of quantum mechanics, the Bohr-Sommerfeld quanti- 
zation rules of the old quantum theory (Sec. 2) occupy a position inter- 
mediate between classical and quantum mechanics. It is interesting that 
there is a method for the approximate treatment of the Schrédinger wave 


1A. Dalgarno and A. L. Stewart, Proc. Roy. Soc. (London) A238, 276 (1956). It 
should be noted that, unlike the situation with the van der Waals interaction discussed 
in the preceding section, there is no correction arising from retardation in the present 
problem. This is because the only motion is that of a single electrons in the electro- 
static potential of two fixed charges. 
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equation that shows its connection with the quantization rules. It is 
based on an expansion of the wave function in powers of i, which, although 
of a semiconvergent or asymptotic character, is nevertheless also useful 
for the approximate solution of quantum-mechanical problems in appro- 
priate cases. This method is called the Wentzel-Kramers-Brillouin or 
WKB approximation, although the general mathematical technique had 
been used earlier by Liouville, Rayleigh, and Jeffreys. It is applicable 
to situations in which the wave equation can be separated into one or more 
total differential equations, each of which involves a single independent 
variable. 


CLASSICAL LIMIT 
A solution y(r,t) of the Schrédinger wave equation (6.16) 


oy 
th a SS ave tT Vir)y 


can be written in the form 


vir,t) = A exp —,;—— W (et) 


in which case W satisfies the equation 


one — (VW)! + V — VW = =0 (34.1) 


In the classical limit (4 — 0), Eq. (34.1) is the same as Hamilton’s partial 
differential equation for the principal function W:? 


om + H(r,p) = 0 p= VW 


Since the momentum of the particle is the gradient of W, the possible 
trajectories are orthogonal to the surfaces of constant W and hence, in the 
classical limit, to the surfaces of constant phase of the wave function y. 


1It is sometimes called the BWK method, the classical approximation, or the phase 
integral method. For the original work, see J. Liouville, J. de Math. 2, 16, 418 (1837); 
Lord Rayleigh, Proc. Roy. Soc. (London) A86, 207 (1912); H. Jeffreys, Proc. London 
Math. Soc. (2)23, 428 (1923); G. Wentzel, Z. Physik. 38, 518 (1926); H. A. Kramers, 
Z. Physik. 39, 828 (1926); L. Brillouin, Compt. Rend. 183, 24 (1926). For more recent 
developments, see E. C. Kemble, ‘‘The Fundamental Principles of Quantum Mechan- 
ics,’”’ sec. 21 (McGraw-Hill, New York, 1937); R. E Langer, Phys. Rev. 61, 669 (1937) ; 
W. Hz. Furry, Phys. Rev. 71, 360 (1947); S. C. Miller, Jr., and R. H. Good, Jr., Phys. 
Rev. 91, 174 (1953). The treatment of this section resembles most closely those of 
Kramers and Langer. 


2K. T. Whittaker, ‘“‘Analytical Dynamics,’’ 3d ed., sec. 142 (Cambridge, London, 


1927); H. Goldstein, ‘‘Classical Mechanics,’ sec. 9-1 (Addison-Wesley, Reading, 
Mass., 1950). 
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Thus in this limit the rays associated with y (orthogonal trajectories to the 
surfaces of constant phase) are the possible paths of the classical particle. 
If y is an energy eigenfunction u(r)e*#"*, W can be written 
Wir,t) = S(r) — Et 
In this case, we have that 


aS (r) 
h 


1 2_ih — thors 

Of (VS) [E — V(x)] on VS = 0 
(34.2) 

The WKB method obtains the first two terms (one term beyond the 

classical expression) of an expansion of S in powers of hf, in the one- 

dimensional case. 


u(r) = A exp 


APPROXIMATE SOLUTIONS 
The basic equation that we consider is written in one of the forms 


d*u 


ut eu=0 b> 0 (34.3) 
| 
a Cee: 2>0 (34.4) 


so that k and « are always real. These are equivalent to the one-dimen- 
sional wave equation (8.5), if we put 


Read: ; {ul — V(z)]}+ when V(t) < E 
(34.5) 
x(x) = + 5 {2u[V(2) — E]}} when V(x) > E 


Equations (34.3) and (34.4) are also equivalent to the radial wave 
equation (19.2) if x is replaced by r, V(r) is replaced by 


RL + 1) 


V(r) + Dart 


and u is equal to r times the radial wave function. 

We restrict our attention for the present to Eq. (34.3); we shall be 
able to generalize the resulting expression for u(x) to obtain solutions of 
(34.4). We put 


which on substitution into (34.3) gives the one-dimensional form of (34.2) 
aS” — 8’? + hk? = 0 (34.6) 


where primes denote differentiation with respect to z. 
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We substitute an expansion of S in powers of f into (34.6) and equate 
equal powers of h. 


S — So + AS, + aca 
—So + 2u(E — V) =0 
iSo — 2858, = 0 ete. 


Integration of these equations gives 
So(z) = +h [* ke’) de" Si(z) = Mn (2) 


where ‘arbitrary constants of integration that can be absorbed in the 
coefficient A have been omitted. We thus obtain to this order of 
approximation 


u(x) = Ak-texp (+i J * dz) V<E (34.7) 
In similar fashion, the approximate solution of (34.4) is 


u(z) = Br-texp (+ J ” « dz) V>E (34.8) 


ASYMPTOTIC NATURE OF THE SOLUTIONS 


The accuracy of these WKB solutions can be gauged by comparing the 
magnitudes of the successive terms Sy and /S; in the series for S. Since 
So is a monotonic increasing function of x so long as k does not vanish, the 
ratio 4S:/So is expected to be small if AS;/S, is small. We thus expect 
(34.7) to be useful in that part of the domain of x where 


AS, | | kh’ 
S| = | ae <1 (34.9) 
The local de Broglie wavelength \ is 21/k, so that (34.9) can be written 
 |dk 
ax = <k 


which means that the fractional change in k (or the wavelength) in the 
distance \/47 is small compared with unity. Thus the WKB solutions 
are useful when the potential energy changes so slowly that the momen- 
tum of the particle is sensibly constant over many wavelengths. 

The same criterion is obtained for (34.8) if we now mean by the 
“wavelength” the distance in which the magnitude of u changes by a 
factor e?*. 

It is apparent that the condition (34.9) is violated near the turning 
points of the classical motion, where V(r) = E, k and « are zero, and the 
“wavelength” is infinite. Thus the solutions (34.7) and (34.8) are 
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asymptotically valid in the sense that they can be used several wave- 
lengths from the nearest turning point if, as is usually the case, the wave- 
length there is slowly varying. 

The asymptotic solutions are most useful if we know how to connect 
an oscillating solution like (34.7) to an exponential solution like (34.8) 
across a turning point. It is in this way, for example, that we apply 
boundary conditions and obtain energy eigenvalues. The derivation of 
such connection formulas, which we consider next, is of interest for this 
reason. However, it should be noted that many applications of the WKB 
approximation do not depend critically on the precise way in which the 
connections are made. 


SOLUTION NEAR A TURNING POINT 


The wave equations (34.3) and (34.4) are regular at a turning point, so 
that there is a solution that is analytic there and has asymptotic forms 
like (34.7) and (34.8). Such a solution usually cannot be written in 
closed form. The wave equation can, however, be modified slightly so 
that an exact solution that has the desired asymptotic forms can be 
obtained. 

We can, without loss of generality, take the origin of xz at a particular 
turning point; we also assume for the moment that V(x) < FE to the right 


of the turning point (positive z) and put é&(7) = i kdz. Now if 
k(x) = Cz”, where C is a positive constant, Eq. (34.3) is known to have 
the solutions 


1 
= 4 —4 = 
u(x) = Aétk-*Jam(') M= 779 (34.10) 
where J is a Bessel function; this can be verified by direct substitution. 
The asymptotic form of J is such (see below) that (34.10) agrees asymp- 
totically with (34.7). 
We therefore try to retain this form by rewriting (34.3) with an 


additional term 6(x) even when k?(z) is not a power of z: 
2 
a + (k? — @)u=0 (34.11) 


Substitution of (34.10) into (34.11) shows that the new equation is 
satisfied if we define @ as 

eek ie gee 
We expand k? as a power series in 2: 


k?(z) = Car(1 + ax + br? + ---) 
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in which case 6 can also be expanded in a series. The 1/z? and 1/z terms 
vanish, and the leading term is independent of z. 
3(n + 5)a? 3b 

%) 2 Batam +e) x46 Bet) 

We can now see that (34.11) is a good approximation to the actual 
wave equation (34.3). The similarity in structure between each of the 
three terms in (34.12) and the asymptotic accuracy criterion (34.9) 
indicates that @< k? in the asymptotic region if the WBK method can 
be used at all. At and near the turning point, @ is not negligible in com- 
parison with k?, since @ is a constant and k? vanishes atx = 0. However, 
(34.13) shows that 6(0) is quite small, being of second order in the devia- 
tion of k? from the simple form Cx". Thus for potential functions V(z) 
that are slowly varying, (34.10) is expected to be a good approximation 
to the actual solution of Eq. (34.3). 


LINEAR TURNING POINT 


We now specialize to the situation of greatest physical interest, in which 
n= 1. A typical linear turning point is shown in Fig. 28; Eq. (34.3) is 
used in region 1 (t > 0), and Eq. (34.4) in region 2 (x < 0). We put 


i= ik kdz, i= i x dz, so that both & and é. increase as x moves 


away from the turning point; this makes it easy to generalize the results 
to situations in which the regions 1 and 2 are interchanged. The two 
independent solutions in each of the two regions are 


ut(z) = Ashik 44(1) 
Ust(x) = By kode*.,(€2) 


It is evident that we must replace J by J, the Bessel function of imaginary 
argument, in region 2. 


(34.14) 


Fig. 28 A typical linear turning point, where V(z) = E at 
z = 0;inregion 1, Z > V(z), and in region 2, E < V(z). 
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We require the leading terms of the power series expansions and of 
the asymptotic expansions for these functions!: 


4 

Tl) Sy tetp 3) 
= (Ar £1)-? cos (« + ; = i) (34.15) 

(gé2)+# 


T44(&2) —> me FL ED) 
——> (2uEs)-4 (et + e- He Gt dn) 


It is important to note that the term e— in the asymptotic expansion 
for I can be retained only when a combination of solutions 7,, is chosen 
such that the coefficient of e& is zero. This is because other terms in the 
asymptotic expansion, such as e/£2, have been neglected, and these are 
of larger order of magnitude than e~*. The asymptotic nature of the 
WKB approximation is such that, if the term that increases exponentially 
away from the turning point is present, it is impossible to say whether or 
not the decreasing exponential term is also there. 


CONNECTION AT THE TURNING POINT 


The leading term in k? at x = 0 is Cz, so that k ~ cz}, « = ela}, & = 
(2c/3)x!, & ~ (2c/3)\x|!, where c = +Ct. Then from (34.14) and 
(34.15) we obtain the behavior of the u’s near z = 0: 


4. @iGOH 


(3)8(ge)# a 
+ x fat JAB ee a ro 
Senge? | ee r@) 
@)#Ge)# p_ &iGe+ 
fe 2s 3 B_ si hsev 
MBE gy Magy 
It is apparent then that uit joins smoothly on to ust if By = —A, and 


that ui- joins smoothly on to uz if BL = A-. 

These relations between the coefficients can be used to obtain asymp- — 
totic forms like (34.7) and (34.8) for the two independent solutions ut and 
u~ in the two regions (the arbitrary multiplying constants A, are omitted). 


br 
+ —} ee oe 
ae (2rk)-* cos G 5) 


PERT _— (2ax)—*(e® + e~ fb ri/6) 
(34.16) 


eae ee («: . 7) 
=> (Qa) (e® + e~ fx—71/8) 


1 Whittaker and Watson, op. cit., chap. 17. 
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The asymptotic forms of any linear combination of ut+ and u- can be 
found from Eqs. (34.16). 


ASYMPTOTIC CONNECTION FORMULAS 


Convenient connection formulas between the asymptotic WKB solutions 
in the two regions can be obtained by choosing suitable linear combina- 
tions of ut and u-. Thus the combination ut + u~ contains only the 
decreasing exponential and yields the first connection formula 


$x te & — k-t cos (£1 — 42) (34.17) 


The arrow in (34.17) implies that the asymptotic solution in region 2 that 
appears on the left goes into the asymptotic solution in region 1 that 
appears on the right but that the converse is not necessarily true. This 
is because a small error in the phase of the cosine introduces the dominant 
increasing exponential in region 2.! 

Another linear combination of u+ and u- can be found that gives the 
second connection formula 


sin nxte& <— k-* cos (1 — 379 + 7) (34.18) 


where 7 is appreciably different from zero or an integer multiple of 1. 
The arrow in (34.18) appears since the neglected decreasing exponential 
in region 2 alters the phase of the cosine in region 1 by an indeterminate 
amount if the connection is reversed. 


ENERGY LEVELS OF A POTENTIAL WELL 


We now give a simple example of the application of the WKB approxima- 
tion that serves as a derivation of one of the Bohr-Sommerfeld quantiza- 
tion rules. We wish to find the energy levels of a particle moving in the 
one-dimensional potential well shown in Fig. 29. For any assumed 
energy level E, there are supposed to be just two turning points of the 
classical motion such that 


Vivi) = Vv.) = # 


The regions x < 21; and x > 22 are type 2 regions in which we know that 
u decreases away from the turning points in order to satisfy the boundary 
conditions at +o. Thus we have only the decreasing exponential WKB 
solution in these regions. 


1 The converse of (34.17) can be used in the following sense: If some parameter in 
the solution (such as the energy £) is varied continuously so that the phase of the 
cosine in region | passes through the value —+z, the increasing exponential in region 2 
disappears for some indeterminate value of the phase close to —}7 and leaves only the 
decreasing exponential. This result is useful, for example, in treating the resonance 
scattering of alpha particles by a heavy nucleus. 
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V(x) 


Region2 1 Region 1 x2 Region 2 x 
| 


Fig. 29. Application of the WKB method to a potential trough; linear turning 
points occur at x; and 22. 


The connection formula (34.17) can be applied at the turning point 
21, which separates a type 2 region from the type 1 region 71 < t < 22. 
The only change is that the lower limit on the é; integral is changed from 
0 to 21, so that the solution to the right of the turning point is 


k- cos ( ih "kde — in) (34.19) 


apart from an arbitrary multiplying constant. The same connection 
formula can also be applied at x2 by reversing the direction of the xz axis 
and changing the fixed limit on the ¢ integrals from 0 to 22; the arrow in 
(34.17) still means that we go from a region 2 solution to a region 1 solu- 
tion, but now the latter is to the left of the turning point and the former 
is to the right. We redefine {: = i: Ie dx, fg = x dx so that they still 
increase going away from the turning point, in which case (34.17) can be 
used without any modification. The solution to the left of this turning 


point is then k~ cos ( i "kd — in), which can be written 


i+ cos ( "kde — 4 — 1) n= [kde — (34.20) 


As was the case in the qualitative discussion of discrete energy eigen- 
values in Sec. 8, we obtain the energy levels of this system by requiring 
that the two solutions (34.19) and (34.20) join together smoothly in the 
interior of region 1. This evidently requires that 7 be zero or a positive 


integer multiple of 7, since f; s k dz is necessarily positive. We can write 
the determining equation for the eigenvalues as 
_, kde = (n+ 4)e n=0,1,2,... (34.21) 


Equation (34.21) is to be used for values of n up to the point at which E 
becomes so large that one or both of the turning points disappears. 


APPROXIMATION METHODS FOR BOUND STATES 277 


A QUANTIZATION RULE 


The expression (34.5) for k can be substituted into (34.21) to give one of 
the Bohr-Sommerfeld quantization rules of the old quantum theory: 


2 f° (2ulE — V(2)]}4de = (n + 4)h (34.22) 


The left side of (34.22) is the integral around a complete cycle of the 
motion (from 2; to x2 and back to x1) of the momentum [2u(H — V)j}. 
The right side is the quantum value of the phase integral, with half- 
integer rather than integer quantum numbers. 

It is easily seen from the form of the solution (34.20) that ; n is the 
number of nodes of the WKB wave function between the turning points. 
Since it is basic to the WKB method that we can develop asymptotic 
solutions like (34.7) only several wavelengths from each turning point, 
the approximation should be good only if the turning points are several 
wavelengths apart, or if n is large in comparison with unity. This con- 
firms the earlier view that the WKB method is a semiclassical approxima- 
tion, since it is expected to be most useful in the nearly classical limit of 
large quantum numbers. 

Actually, the WKB approximation also gives quite good results for 
the low quantum states of many systems. For example, if we apply 
(34.22) to the harmonic oscillator V(x) = +Kzx?, it is known from the old 
quantum theory that the correct energy levels happen to be obtained for 
all quantum numbers (see Prob. 16). 


SPECIAL BOUNDARY CONDITIONS 


The boundary condition to be applied to a WKB solution at a perfectly 
rigid wall (V changes discontinuously to + © at x = 20) is that the wave 
function vanishes there. Thus if k (for a region of type 1) is slowly 
varying up to x and other turning points are remote, the asymptotic 
solution can be used and has the form 


k-4 sin ( i k dz: ) 


Similarly, for a finite potential step that is far from other turning 
points, the asymptotic WKB solutions can be used up to the point of 
discontinuity of V if k or « is slowly varying. Then the magnitudes and 
slopes of the solutions on the two sides can be matched at this point. 

As pointed out. after Eq. (34.5), the WKB method can be applied to 
the radial wave equation for a spherically symmetric potential. When 
[ = 0, the radial wave function must be finite at r = 0, and so u must 
vanish there. If k or « is slowly varying there, the asymptotic solutions 
can be used; for example, if FE — V(r) is positive, finite, and slowly vary- 
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ing at and near r = 0, the solution is k~? sin ( ff kar). When the 


effective potential energy is infinite at r = 0, either because V itself is 
infinite or because of the centrifugal-force contribution for | ~ 0, the 
situation is more complicated and requires further investigation. 


TUNNELING THROUGH A BARRIER 


Figure 30 shows a typical potential barrier, drawn as a function of the 
radial coordinate; as indicated below Eq. (34.5), the centrifugal term is 
included. In the type 2 region, the wave function is a real exponential 


of the form (84.8). Thus if f " « dr is appreciably larger than unity, the 


behavior of the solution is dominated by the large ratio of the wave 
function at the two turning points. We call the corresponding probabil- 
ity ratio, which is the square of the wave-function ratio, the barrier 
penetration factor P: 


P = exp [ -2 ie k(r) ar | where 


2 
dr) = +5 {20 | Ve a ae = z| (34.23) 
Tunneling through a square potential barrier, described at the end of 
Sec. 17, is a special case of this. It is easily seen that the dominant 
exponential factor in Eq. (17.8) is just the P given by (34.23). 
In radioactive a decay, V(r) = +ZZ’e?/r, where Z is the atomic 
number of the final nucleus, and Z’ = 2. We assume that this expression 


l(L+ 1)h2 
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Fig. 30 Application of the WKB method to a potential barrier, with 
the radial coordinate as the independent variable. 
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for the potential is applicable only for r > R; within the nuclear radius R, 
strong attractive forces more than cancel the coulomb repulsion, so that 
= R. Forl = 0, re. = ZZ’e?/E, so that 


rs 3 *¢2 "e2 t 
[Poa ms i = 2i¢ — 2) dr 


h JR 
_ aZZ'e? ee eer a _ wk = 
see ( ~ sin *) 7 (y — 1) (34.24) 


where E = iyv? and y = ZZ'e?/ER is the ratio of the coulomb-barrier | 
height to the kinetic energy of the emerging @ particle. Lifetimes for a 
decay are expected to be of the order of some characteristic time 7, divided 
by the P obtained from Eqs. (34.23) and (34.24). In this case, the wave 
function is much larger at 7; than at re, and 7 is roughly the period of 
oscillation of an a@ particle within the attractive nuclear potential of 
radius R. Withr ~ 10-#! sec and R somewhat less than 10—!? cm, a good 
fit with the observed Z and E dependence is obtained over a range of 
lifetimes of the order of a factor 1074. 

The barrier penetration factor also enters into the rate of nuclear 
reactions. In this case the wave function is much larger at re than at 71, 
and it is evident that P is in agreement with the Gamow factor, defined 
below Eq. (21.14), when R = 0. 

As another example, we consider whether or not a zero-energy 
particle can be bound; this is of interest in connection with the work of 
Secs. 19 and 39. When E = 0, 


[0K ar = Gy) ie | ve ee aoe ote ge | i (34.25) 


If 1 = 0 and V(r) has a finite range, this integral is finite, and P = 0. 
Similarly, if V(r) extends to infinity and falls off faster than 1/r? for large 
r, the integral is finite when r, = ©. On the other hand, if 1 > 0, (34.25) 
is infinite, and P = 0. Thus, as remarked near the end of Sec. 19, 
resonance at zero energy leads to a true bound state if / is not zero. 

As pointed out earlier in this section, the applications of the WKB 
approximation made here do not depend critically on the precise way in 
which the connections across turning points are made. In the derivation 
of the quantization rule (34.22), only the number 3 on the right side 
follows from the use of the connection formulas; the discussion of barrier 
penetration does not depend on them at all. 


350 METHODS FOR TIME-DEPENDENT PROBLEMS 


When the hamiltonian depends on the time, there are no stationary solu- 
tions of the Schrédinger equation. Thus our identification of a bound 
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state with a discrete energy level and stationary eigenfunction must be 
modified. We consider in this section three ways in which this modifica- 
tion can be made, each of which corresponds to a particular kind of 
approximation. The time-dependent perturbation theory, which is some- 
times called the method of variation of constants, assumes as in Sec. 31 that 


H= Ho + H!’ Hou = Exyux (35.1) 


where Hp is simple and H’ is small. However, H’ now depends on the 
time and has the effect of causing transitions between eigenstates of Ho 
that would be stationary in the absence of H’. Alternatively, rather than 
decompose the hamiltonian in accordance with (35.1), the absabatic 
approximation assumes that H contains parameters that change very 
slowly with time. Thus the system is expected to be described approxi- 
mately by means of stationary eigenfunctions of the instantaneous 
hamiltonian. Finally, we consider the sudden approximation, in which H 
is constant in time except for a very short time interval in which it changes 
from one form to another. 


TIME-DEPENDENT PERTURBATION THEORY? 


We must now work with the time-dependent Schrédinger equation 


OW 3 | 
th = = Hy (35.2) 


rather than with the first of Eqs. (31.1). Our procedure consists in 
expressing y as an expansion in the eigenfunctions u,e~*="/* of the unper- 
turbed time-dependent wave equation, where the expansion coefficients 
evidently depend on the time: 


Y= Snda(t)Une tall (35.3) 
Substitution of (35.3) into (35.2) gives 
SnthGntineEnt!® 4+ San EnUneBnt!® = San(Ho + H’)une tent! 


where the dot denotes differentiation with respect to the time. 

We replace Hou, by E,u, on the right side, multiply through on the 
left by uy, and integrate over all space, making use of the orthonormality 
of the w’s: 


thda,e~Etl* = Sndne*Zat/*Ie| H'\n) 
We define the Bohr (angular) frequency 


ie see (35.4) 


1P, A. M. Dirac, Proc. Roy. Soc. (London) A112, 661 (1926); A114, 243 (1927). 
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and obtain 
Gy = (th) S,.(k|H |n)anete! . (35.5) 


The group of Eqs. (35.5) for all k is exactly equivalent to the Schrédinger 
equation (35.2). 


INTERACTION PICTURE 


Two kinds of changes have been made in going from (35.2) to (35.5). 
First, we have changed the representation from being specified in terms of 
the coordinates to being specified in terms of the unperturbed energy 
eigenvalues (see Sec. 23). Second, we have changed from the Schrédinger 
to the interaction picture (see Sec. 24). It is worth looking at these two 
changes in some detail. The Schrédinger-picture ket |as(t)) corresponds 
to the wave function y and has the representation in terms of eigenvalues 
of the unperturbed hamiltonian Ho, 


(klas(t)) = (un) = ax(tje Bet! (35.6) 


where use has been made of Eq. (35.3). The interaction picture ket 
lay(t)) may be found from the first of Eqs. (24.12); in the same repre- 
sentation it is 


(klar(t)) = (kleest!*lars(t)) = (kle*est!*|n)(nlas(t)) 


where S,, is implied on the right side. The first bracket expression on the 
right side is simply e*#+/*(k|n), since Hog is diagonal in this representation. 
Thus, with the help of (35.6), we find that 


(klar(t)) = ax(4) 


The perturbation that appears in (35.1) or (35.5) is given in the 
Schrédinger picture and thus should be designated H's. It may be found 
in the interaction picture from the second of Eqs. (24.12), which in our 
representation is 


(k|Hp|n) = (lele**ost!%3)(j| HT 'g|0X | e~*#st!*|n) 
ef Eatin k| H’,|n)e~*B ath 


(k|H g|n)ete! 


Thus the equation of motion (24.13) for the ket in the interaction picture, 
expressed in the Ho representation, is identical with Eq. (35.5).! 
1 Because this equation was first obtained by Dirac (Joc. cit.) in connection with the 


time-dependent perturbation theory, the interaction picture is sometimes called the 
Dirac picture. 
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FIRST-ORDER PERTURBATION 


We now return to Eq. (35.5), replace H’ by. \H’, and express the a’s as 
power series in \: 


An = On + rag + 2a, +--+ > (35.7) 


As in Sec. 31, we assume that this series is a continuous analytic function 
of \ for X between zero and one. We therefore substitute (35.7) into 
(35.5), equate coefficients of corresponding powers of \, and set \ = 1 in 
the final results. The substitution yields the set of equations 


a = 0 dt = (A)-1S,(K|H’[n)ay eter! (35.8) 


These can, in principle, be integrated successively to obtain approximate 
solutions to any desired order in the perturbation. 

The first of Eqs. (35.8) shows that the zero-order coefficients a, are 
constant in time. Their values are the initial conditions of the problem 
and specify the state of the system before the perturbation is applied. 
We shall assume that all except one of the a, are zero, so that the system 
is initially in a definite unperturbed energy state.! The results that we 
shall obtain can easily be generalized to situations in which more than one 
of the zero-order coefficients are different from zero. 

We thus put a, = (k|m) = dim or 5(k — m), according as the 
initial state m is one of a discrete or continuous set. Integration of the 
first-order equation gives 


a(t) = (any? "(RLU myelin de (35.9) 


where the constant of integration is taken to be zero in order that a,“ be 
zero at t = —o (before the perturbation is applied). If H’ is of finite 
duration, the amplitude of a state u, (k # m) after the perturbation has 
disappeared is proportional to the time Fourier component of the matrix 
element of the perturbation between this state and the initial state that 
corresponds to the angular frequency wim given in (35.4). 


HARMONIC PERTURBATION 


Equation (35.9) takes a particularly simple form if the perturbation H’ 
depends harmonically on the time except for being turned on at one time 
and off at a later time. We call these two times 0 and éo, respectively, and 


1 This need not conflict with the uncertainty relation (3.3), since the infinite lapse of 
time prior to the application of the perturbation makes it possible to determine the 
original energy of the system with arbitrarily great precision. 
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assume that we can write 
(k|H'(t')|m) = 2(k|H’|m) sin wt’ (35.10) 


where (k|H’|m) is independent of the time and w is defined to be positive. 
Substitution into (35.9) gives for the first-order amplitude at any time ¢ 
at or after to 
k|H’|m) et@mto)te = 1 et(@xm—o) to <a l 
a, —— ( ee eae ee 
(2 be) th Wkm + w Wkm — 


(35.11) 


The structure of Eq. (35.11) suggests that the amplitude is apprecia- 
ble only when the denominator of one or the other of the two terms is 
practically zero. The first term is important when wim ~ — or E, = 
E,, — fw, and the second term is important when wm gor E, ~ Ey, + 
hw. Thus the first-order effect of a perturbation that varies sinusoidally 
in the time with angular frequency w is to transfer to or receive from the 
system on which it acts the Planck quantum of energy hw.' This concept 
will be used to treat radiation processes in Chap. 11. 

For the present, we specialize to a situation in which the initial state 
m is a discrete bound state and the final state k is one of a continuous set 
of dissociated states. Then E, > E,, and only the second term in (35.11) 
need be considered. The first-order probability of finding the system in 
the state k after the perturbation is removed is 


4|(k|H’|m)|? sin? 1 (wkm a w)to 


(1) 2= 
laz(t > to)| 1? (Oem — 0)? 


(35.12) 


TRANSITION PROBABILITY 


The factor sin? 4(wim — )to/(wim — w)? is plotted in Fig. 31 as a function 
of wim — w. The height of the main peak increases in proportion to ¢)?, 
and its breadth decreases inversely as to, so that the area under the curve 
is proportional to t. Thus if there is a group of states k that have 
energies nearly equal to E,, + hw, and for which (k|H’|m) is roughly inde- 
pendent of k, the probability of finding the system in one or another of 
these states is proportional to tp. This is the physically interesting situa- 
tion, since what we wish to calculate eventually is a transition probability 
per unit time, w, and this implies that the probability that a transition has 


1 This is related to the reason for the insertion of the factor 2 on the right side of Eq. 
(35.10) in defining the time-independent matrix element (k|H’|m). A perturbation 
that is proportional to sin wt or to cos wt contains both time factors e* and e~** with 
equal amplitude. Since only the time factor e~*“ leads to transfer of energy from the 
perturbation to the unperturbed system, the insertion of the factor 2 ensures that the 
physically important matrix element of the perturbation is (k|H’|m) and not half 
this quantity. 
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Y, ty? 


sin? %(wkm—w) to 
(wkm—w)? 


Fig. 31. The ordinate is proportional to the first-order perturbation probability of 
finding a system in a state of energy Ex = Em + hwikm; the heights of the secondary 
peaks fall off like 1/(wim — w)?. 


taken place after the perturbation has been on for a time ty is proportional 
to ty. 

The spread of energy of the final states to which transitions occur is 
connected with the uncertainty relation (3.3) in the following way. We 
can regard the perturbation H’ as a device that measures the final energy 
of the system by transferring it to one of the states k. The time available 
for the measurement is to, so that the uncertainty in energy predicted by 
(3.3) is of order f#/to; this is in agreement with the breadth of the main 
peak in Fig. 31. It is interesting to note that conservation of energy, as 
expressed by the relation EL, ~ E, + hw and suitably modified by the 
uncertainty principle, is an automatic consequence of the calculation and 
does not have to be inserted as a separate assumption. 

The transition probability per unit time is given by integrating 


1 We assume that the total transition probability to all states k is small enough in 
comparison with unity so that the initial state m is not significantly depleted. This 
is equivalent to the original assumption that the perturbation is small, which means 
that for times to of physical interest there is little change in the initial state. There 
can still be an effect of observable magnitude if a large number of independent systems 
receive identical treatment. 
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(35.12) over k and dividing by to: 
; 
w= > i) lax(t > t)|2o(k) dE, (35.13) 
0 


where p(k) dH, is the number of final states with energies between FE, and 
E, + dE,. The concept of an energy density p(k) of final states is sensi- 
ble, since we are considering the case in which the transition is to one or 
another of a continuous set of dissociated states. We now take advantage 
of the fact that the breadth of the main peak in Fig. 31 becomes small as to 
becomes large, and we regard (k|H’|m) and p(k) as quantities sufficiently 
independent of E; so that they can be taken outside the integral in Eq. 
(35.13). We further simplify the integral by changing the integration 
variable from E; to x = 4(wem — w)to and extending the limits on z to 
+o. Substitution of (35.12) into (35.13) then gives! 


w = 2 p(k) (k|H'|m) (35.14) 


where we have made use of the result / if x? sintadz=-7. This 


expression for w is independent of fo, as expected. 

There may be several different groups of final states ki, ke, ... , 
all of which have about the same energy E,, + hw but for which the 
perturbation matrix elements (k;|H’|m) and the densities of states p(k.), 
although nearly constant within each group, differ from one group to 
another. Then Eq. (35.14), with k replaced by k,, gives the transition 
probability per unit time to the zth group. 

It is apparent that the foregoing treatment fails to give a transition 
probability that is proportional to the time if the final as well as the 
initial state is discrete. In this case, Eq. (35.12) shows that |a,(é > o)|? 
depends in a peculiar way on ¢) and on wim — w. We return to this 
situation in Chap. 11 in connection with radiation processes. 


{ONIZATION OF A HYDROGEN ATOM 


As an example of the first-order time-dependent perturbation theory, we 
now calculate the probability of ionization of a hydrogen atom initially 
in its ground state when it is placed in a harmonically time-varying 
electric field. We might, for instance, think of the atom as being placed 
between the plates of a capacitor to which an alternating voltage is 
applied. This is, of course, not a realistic situation, since the circular 


1 Equation (35.14) together with its analog for w = 0, which will be discussed in Sec. 
37, is so useful that it was called ‘‘Golden Rule No. 2” by E. Fermi, ‘‘Nuclear Physics,”’ 
p. 142 (University of Chicago Press, Chicago, 1950). 
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frequency w/2x must exceed yet/4rh? = 3.6 X 10'® cycles/sec in order 
for ionization to occur. However, the electric field produced by the 
capacitor can be regarded as a model for the field associated with a travel- 
ing electromagnetic wave that lies in the ultraviolet part of the spectrum. 

The perturbation H’ is given by Eq. (31.26), where in accordance 
with (35.10) we write 


E(t) = 2Eo sin wt (35.15) 


The initial ground state of the hydrogen atom is obtained from Eqs. 
(16.24) with Z = 1: 


Un = Uroo(L) = (wao?)~he7/%0 (35.16) 


The final states should correspond to the motion of a positive-energy 
electron in the coulomb field of the proton (scattering states). However, 
as suggested by the discussion in Sec. 21, these wave functions are rather 
complicated, and it is instructive to consider only the much simpler 
approximate situation in which the coulomb interaction is neglected after 
ionization has occurred. Then the final states are free-particle momen- 
tum eigenfunctions (plane waves). It is convenient to use box normal- 
ization, so that 


u, = L- exp (ik-r) (35.17) 


where fk is the momentum of the ejected electron. 


DENSITY OF FINAL STATES 


The density of final states can be found from the permitted values of k 
in a box: k, = 2rn,/L, ete., where the n’s are positive or negative integers 
or zero. Thus there are (L/27)? dk, dk, dk, states in the range dk, dk, dk, 
of the propagation vector. Energy conservation gives E, = En, + he, 
or A2k?/2u = —pe*/2h? + hw, so that the magnitude of k is fixed. The 
matrix element that appears in (35.14) will, however, depend on the 
direction of k, and so it is reasonable to define groups of final states k; 
each of which corresponds to an infinitesimal range of directions for the 
motion of the ejected electron. We therefore express the range of the 
propagation vector, which is given above in rectangular coordinates, 
in terms of spherical coordinates as k* dk sin @d6 dq, where 6, ¢ are 
the polar angles of k with respect to some direction which we take 
for convenience to be that of the electric field. Then p(k) dE, is equal 
to (L/2:r)*k? dk sin 6 dé d¢, where dE, and dk are related by differentiation 
of the equation E, = f?k?/2u. We thus obtain 


BL |. 
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1ONIZATION PROBABILITY 


The matrix element that is to be inserted into Eq. (38, 14) is obtained 
from Eqs. (31.26), (35.10), and (35.15) and the initial oe final wave 
functions (35.16) and (35.17). The result is 


(k|H’|m) = eEo(wao*L*)—3f eres ’r cos 8e-1/% d3r (35.19) 


where 6’ is the angle between r and k, and 6” is the angle between r and 
the electric field. As remarked in the preceding subsection, we use 6 to 
denote the angle between k and the electric field. The integration over 
the direction of the vector r may be performed either using k as the polar 
axis (in which case 6’’ must be expressed i in terms of 6’ and @) or using the 
electric field direction as the polar axis (in which case 6’ must be expressed 
in terms of 6’’ and 6). It is simpler to adopt the first procedure and put 


cos 6’ = cos 6’ cos 6 + sin 6’ sin 6 cos (¢’ — x) 


Here 6’, ¢’ are the polar angles of r with respect to k as polar axis, and 
6, x are the polar angles of the electric field direction with respect to k, 
as in Fig. 32. 


x 


Fig.32 Diagram for the integration of the matrix element (35.19). 


288 QUANTUM MECHANICS 


We can now replace dr by r? dr d(cos 6’) d¢’ in Eq. (35.19). The ¢’ 
integration causes the second term in the above expression for cos 6”’ to 
vanish. The rest of the evaluation of the matrix element is straight- 
forward and leads to 


= 32mrieEokay® cos 6 
(wao®L*)*(1 + kao”)? 


Substitution of this matrix element, together with the expression (35.18) 

for p(k), into (35.14) gives for the probability per unit time that the 

electron of the hydrogen atom is ejected into the solid angle sin 6d6d¢@ 
256uk3e7E 92a? 


<<) ee ee 2 i 
w rh + ag) 6 °O8 6 sin 6 dé d¢ (35.20) 


(k|H'|m) = 


The differential ionization probability per unit solid angle, 
w/(sin 6d@d@), is proportional to cos? 6. This is not surprising, since 
the probability amplitude is expected to be proportional to the component 
of the driving force in the direction of ejection, which is eEy) cos 6. The 
dependence of w on w is given most reliably when w is large, since then k 
is also large, and the modification of the plane-wave final state by the 
coulomb field of the proton is expected to be least important. This may 
be seen from Eq. (21.13), which shows that, for large k or small n, |u-(0)|? 
approaches 1/v, which is the probability density at r = 0 for a plane 
wave that is normalized to unit flux. 


SECOND-ORDER PERTURBATION 


It is a straightforward matter to substitute an expression like (35.11) 
into the right side of the second of Eqs. (35.8) and integrate the resulting 
equation for d,®. Without going through the rather lengthy details of 
this calculation, it is not difficult to see that the second-order amplitude 
contains terms that are appreciable when wim is close to 2w, —2w, and 0. 
Terms of this type are physically plausible, since they correspond respec- 
tively to the absorption of two Planck quanta fw by the system, the 
emission of two quanta, and the absorption and emission of one quantum. 
Such two-quantum processes are expected from a second-order calcula- 
tion in which the perturbation acts twice, in analogy with the one-quan- 
tum processes obtained as the result of the first-order calculation. 
Unfortunately, however, the second-order amplitude also contains 
terms that are appreciable for other values of wim. A careful analysis 
shows that these unphysical terms arise from the earlier assumption that 
the perturbation (35.10) is turned on suddenly at t = 0. The resulting 
discontinuity in the time derivative of the perturbation introduces Fourier 
components that are responsible for the secondary peaks in Fig. 31. In 
reality the perturbation will usually be turned on gradually during a time 
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interval that will be much longer than 1/w, so that the breadth of the 
Fourier transform of the perturbation, which has a maximum at », will 
be small in comparison with w. Also, this time interval will be much 
shorter than the total time é that the perturbation is on, in order that 
to can be specified with reasonable accuracy. The sudden turning on of 
the perturbation does not cause serious difficulties in first order, since we 
were able to ignore the secondary peaks in Fig. 31 even though their 
heights fall off only as 1/(wsm — w)?. In second order, however, the 
effects are more marked and cannot be ignored; it is necessary then to 
turn the perturbation on slowly, and this complicates the calculation. 

We shall not pursue this further here, but the problem will arise 
again in Sec. 37 in connection with collision theory. The case in which 
a sudden change in the hamiltonian of a system is an essential part of 
the physical situation, rather than merely a mathematical convenience 
as in the derivation of (35.11), is discussed toward the end of this section. 


ADIABATIC APPROXIMATION! 


It was remarked at the beginning of this section that methods can be 
developed for treating systems in which the time dependence of the 
hamiltonian is small, slowly varying, or rapidly varying; these correspond 
to the perturbation, adiabatic, and sudden approximations, respectively. 
In the adiabatic case, we expect on physical grounds that solutions of 
the Schrédinger equation can be approximated by means of stationary 
eigenfunctions of the instantaneous hamiltonian, so that a particular 
eigenfunction at one time goes over continuously into the corresponding 
eigenfunction at a later time. In other words, if the equation 


H(t)ua(t) = En()un(é) (35.21) 


can be solved at each instant of time, we expect that a system that is in | 
a discrete nondegenerate state un(O) with energy E,,(0) at t = 0 is likely 
to be in the state u,,(t) with energy E,,(t) at time t, provided that H(t) 
changes very slowly with time. Our objective is to estimate the extent 
to which this expectation is not fulfilled, so that other states appear in 
the expansion of y in terms of the w’s. 

The wave function y satisfies the time-dependent Schrédinger 
equation 


nS? = Hy (35.22) 

We proceed by expanding y in terms of the w’s in the following way: 
= é / , 

Y= ) an(tun(t) exp | Gh)! ff Bale’) at’ | (35.23) 


1M. Born and V. Fock, Z. Phys. 61, 165 (1928). 
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where we assume that the u, are orthonormal, discrete, and nondegen- 
erate. Substitution into Eq. (35.22) gives 


2 | dat + Gn | exp | ca ih E,(t’) ar | ai 


where use has been made of (35.21). We multiply through on the left 
by uy and integrate over all space to obtain 


dy = — Y anck|n) exp | (at) [* (BE, — Ex) at’ | 
(kn) = fut Se ae (35.24) 
An expression for (k|”%) can be found by differentiating Eq. (35.21) with 


respect to the time: 


se Onin _ 9En OUn 
a + H — “Ot Un + E,, “Ot 


We multiply through on the left by uf, where k ~ n, integrate over all 
space, and make use of Eqs. (23.21) and (35.21) to obtain 


(i a n) = (E,— Ekin) ok én (35.25) 


CHOICE OF PHASES 
We also require an expression for (n|7). Differentiation of the equation 
{n|n) = 1 with respect to time gives 

(n\n) + (n\n) = 0 


Since these two terms are complex conjugates of each other, each is purely 
imaginary, and we can write (n|n) = za(t), where a(t) is real. We now 
change the phase of u, by an amount y(t), which is permissible since the 
phases of the eigenfunctions are arbitrary at each instant of time. For 
the new eigenfunction u, = une”, 


{n'|n’) = (nln) + ty = ta + ¥) 
Thus the choice y(t) = — if a(t’) dt’ for the phase makes (n’|7’) = 0. 


In what follows, we assume that all phases have been chosen in this way 
and omit the primes. 
Substitution of sa 25) into (35.24) then gives 


a, = DK i (k oe n) exp lé f * conn (t’) ar | (35.26) 
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where the prime on the summation indicates that the term = k is 
omitted. The group of Eqs. (35.26) for all k is exactly equivalent to 
the Schrédinger equation (35.22). 

We can now estimate d, by assuming that all quantities (dn, win, 
Un, OH /dt) that appear on the right side of (35.26), and which are expected 
to be slowly varying, are actually constant in time. If, further, we 
assume that the system is in the state m att = 0, we can put a, = dam. 
We thus obtain 


dy = (hem)? (i 


oH 10 emt 
oa |) em k#Am 


which is readily integrated to give 
aH 
ot 
With the above approximations, Eq. (35.27) shows that the prob- 
ability amplitude for a state other than the initial state oscillates in time 
and shows no steady increase over long periods of time even though H 
changes by a finite amount. The magnitude of this amplitude after a 
long time is of the order of the ratio of the change in H during the Bohr 
period 21/wim to the energy difference E, — E, between the states. 


ay(t) = (thom?) ¢ 


n) (cient — 1) km (35.27) 


CONNECTION WITH PERTURBATION THEORY 


An exceptional situation arises when the hamiltonian oscillates in time 
with a frequency nearly equal to one of the transition frequencies, say 
wpm. ‘This is a case of resonance, and we expect from the discussion of 
time-dependent perturbation theory earlier in this section that even a 
very small change in H can produce an appreciable amplitude a, over 
long periods of time, so that (35.27) cannot be valid. The reason for 
the failure is that it is then no longer permissible to assume that the time 
dependence of dH /dt can be neglected, so that the passage from (35.26) 
to (35.27) is not justified. 

In order to consider this case in a way that permits comparison with 
the results of perturbation theory, we assume that only a small part of 
H oscillates in time with an angular frequency w that is close to |wem|: 

: oH : 

H = Hy, + 2H’ sin wt ap 2wH’ cos wt 
where H’ is small compared with Hy and both are constant in time. If 
the dependence of an, win, and u, on time is neglected and we put dn = Sam 
as before, Eq. (35.26) becomes 

w 


fsa OS. ’ ier tot L. giWrm—w)t 
a (k|H |m)(e + etm) 
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which is readily integrated to give 
oie ee) 


thoem Wkm + w km — 


a(t) ~ (35.28) 

Equation (35.28) shows first that the adiabatic approximation 
breaks down for wim ~ +, since then (35.28) increases steadily with 
the time. It also shows that the perturbation result (35.11) is repro- 
duced. If wim is close to +w, the first term in the parentheses can be 
neglected and we can replace w/wim outside the parentheses by +1; if 
wm 18 close to —w, the second term in the parentheses can be neglected 
and we can replace w/wm by —1. 


DISCONTINUOUS CHANGE IN 


As an introduction to the sudden approximation, we consider first a 
situation in which the hamiltonian changes discontinuously from one 
form that is constant in time to another. Suppose that H = Ho for 
t<0Oand H = H, fort > 0, where 


Hottn = EnuUn Ay, a wp 


and the u’s and v’s are complete orthonormal sets of functions that are 
not necessarily discrete. The general solutions can be written 

Y = SpGnine tnt! t<0 

y = Sb yee atl t> 0 
where the a’s and b’s are independent of the time. 

Since the wave equation (35.22) is of first order in the time, the 
wave function at each point in space must be a continuous function of 
the time at t = 0, although its time derivative is not. The b’s are then 
readily expressed in terms of the a’s by equating the two solutions (35.29) 
at ¢ = 0, multiplying by a particular v*, and integrating over the 
coordinates: 

by = Sndn(uln) (35.30) 
If the system is initially in the state m, so that a, = (n|m), Eq. (35.30) 
tells us that 6, = (ujm), and final states appear that do not have the 
same energy as the initial state. This is a consequence of the nonzero 
frequency Fourier components into which the suddenly changing hamil- 
tonian can be resolved. 


(35.29) 


SUDDEN APPROXIMATION! 

The sudden approximation consists in using Eq. (35.30) when the change 
in the hamiltonian occupies a very short but finite interval of time fp. 
In order to make an estimate of the error introduced in this way, we 
1'W. Pauli, Handbuch d. Physik 24, part 1, 164 (1933). 
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consider a problem that, although somewhat artificial, can easily be 
solved formally. Suppose that H = Hyofort < 0,H = Ay fort > to, and 
H = H,for0 < t < t. The intermediate hamiltonian, which is assuméd 
to be constant in time, has a complete set of energy eigenfunctions: 


Hw, = Ew, 


The true solution can be expanded in terms of the w’s with constant 
coefficients: 


y = SOx Wee Eat! 0 <t< to 


The continuity condition at t = 0 gives & = S,dn(x|n), and the 
continuity condition at ¢ = to then gives 


by = Se6( ue] x) Bax E ud tol ® 
= Sn da{ul ape Ee-Bn)te!% «| 0) (35.31) 
When to = 0, the exponential is equal to unity, and an equation like 
(23.9’) shows that b, is given by (35.30), as it should be. 


The sudden approximation is evidently best when ¢) is small. The 
exponential in (35.31) can then be expanded to give 


by ~ SaauSsule) [1 — 58, — B,) | deny 
which can be rewritten as 
by = Snantu| E = - (A; - -H)| |”) (35.32) 


by making use of the formalism of Sec. 23. Thus the error in the sudden 
approximation is proportional to t) for small tp and can be estimated in 
simple cases from Eq. (35.32). This equation can be generalized to a 
situation in which H; depends on the time; in this case a result that is 


also correct to first order in tp is obtained by replacing Hto by f : H, dt. 
An interesting special case of Eq. (35.32) is that in which the initial 


and final hamiltonians are the same (H; = Ho) and the system is initially 
in a particular state m. We then obtain 


be = Sem — Ho (k|(He — H)|m) 


This can be used even when H; — Hp is not small in comparison with Ho, 
provided that ¢o is sufficiently small. On the other hand, the perturba- 
tion theory given earlier in this section is useful when H; — Ho < Ho 
and to is large. 
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DISTURBANCE OF AN OSCILLATOR 


As a simple example of the application of the adiabatic and sudden 
approximations, we consider a linear harmonic oscillator in which the 
position of the equilibrium point a(é) depends on the time. The hamil- 
tonian for this system is 


H Wo K ‘ 
() = ~ om age $K[x — a(d)] 

The instantaneous energy eigenfunctions are the harmonic-oscillator wave 
functions (13.13) centered at the point a(é), and the energy levels are 


unchanged: 
Un(z) = N,H,[a(z — a) ]e~ta*(e—2)* E, = (n+ 3)ho. 


We suppose first that the equilibrium point moves slowly, and we 
investigate the circumstances under which the adiabatic approximation 
is applicable. If the oscillator is initially in its ground state (n = 0), 
the time derivative of the hamiltonian dH /dt = — K(x — a)é has a non- 
vanishing matrix element only with the first excited state. With the 
help of (13.18) this is found to be 


(= — B= (tm 


Substitution into (35.27) shows that the coefficient of the time-dependent 
factor in the amplitude of the first excited state has the magnitude 


Ka (4/2)! 
hu? (Km)t  (2hiw,/m)3 


Equation (35.33) may be interpreted physically by noting that the denom- 
inator is of the order of the maximum spéed of a hypothetical classical 
oscillator that has the zero-point energy. Thus the adiabatic approxi- 
mation is good if the equilibrium point moves slowly in comparison with 
the classical-oscillator speed. It is not difficult to see that, for the nth 
excited state, the corresponding condition is that the equilibrium-point 
speed be small in comparison with 1/n times the corresponding classical- 
oscillator speed. 

Application of the sudden approximation to an oscillator in its 
ground state gives for the probability amplitude of the nth state after 
displacement of the equilibrium point from zt = 0 toz = a 


(35.33) 


at pe 


t fe 
S [Oo ut @ — aete de = Sf" ut (jeter de (35.84) 


This integral is identical with the expression for A, in Eq. (13.21), except 
for the sign of a, and has already been evaluated with the help of the 
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generating function (13.10) for the Hermite polynomials. The earlier 
discussion (Sec. 13) shows that the states most likely to be excited are 
those that have a classical amplitude of oscillation that is of the order 
of the displacement a; this is in agreement with the corresponding 
classical result. Equation (35.32) can then be used to show that the 
sudden approximation is valid in this case if the time required to move 
the equilibrium point is small in comparison with 1/no times the classical- 
oscillator period, where mo is the quantum number of the state most 
likely to be excited. 


PROBLEMS 


1. A one-dimensional harmonic oscillator of charge e is perturbed by an electric field 
of strength E in the positive z direction. Calculate the change in each energy level to 
second order in the perturbation, and calculate the induced electric dipole moment. 
Show that this problem can be solved exactly, and compare the result with the per- 
turbation approximation. Repeat the calculation for a three-dimensional isotropic 
oscillator. Show that, if the polarizability a of the oscillator is defined as the ratio of 
the induced electric dipole moment to E, the change in energy is exactly —dea?. 

2. A one-dimensional harmonic oscillator is perturbed by an extra potential energy 
bz?. Calculate the change in each energy level to second order in the perturbation. 
3. Find the first-order Stark effect for a hydrogen atom in the state n = 3. Sketch 
the arrangement of the levels and state the quantum numbers associated with each. 


4. A system that has three unperturbed states can be represented by the perturbed 
hamiltonian matrix 


E, 0 a 
Oo EF, b 
a* b* E2 


where E, > E,. The quantities a and } are to be regarded as perturbations that are 
of the same order and are small compared with E, — Ei. Use the second-order non- 
degenerate perturbation theory to calculate the perturbed eigenvalues (is this pro- 
cedure correct?). Then diagonalize the matrix to find the exact eigenvalues. Finally, 
use the second-order degenerate perturbation theory. Compare the three results 
obtained. 

5. A trial function y differs from an eigenfunction uz by a small amount, so that 
vy = uz + a1, where ug and y; are normalized and e <1. Show that (H) differs 
from E only by a term of order e?, and find this term. 

6. If the first n — 1 eigenfunctions of a particular hamiltonian are known, write a 
formal expression for a variation-method trial function that could be used to get an 
upper limit on the nth energy level. 

7. Find the next terms (of order R~‘) in the expansion of Eq. (32.12). Show that 
their diagonal matrix element for the unperturbed ground state vanishes, so that there 
is no inverse fourth-power contribution to the van der Waals interaction. 

8 Use the first nonvanishing term in the series (32.13) to get a lower limit for 
—W(R). Compare with that obtained from the variation calculation. 
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9. Use the combination of perturbation and variation methods employed in Sec. 32 
in connection with the van der Waals interaction to obtain limits on the electric 
polarizability of a hydrogen atom in its ground state. 

10. A particle of mass m is bound by the potential V(r) = —Voe-"/*, where 
h?/mV oa? = 2. Use the variation method with the trial function e-7 to get a good 
limit on the lowest energy eigenvalue. 


11. A hydrogen atom is placed in a uniform electric field of strength E. Use Eq. 
(33.6) to calculate the total electric field at the nucleus. Give a qualitative discussion 
of your answer in physical terms. 

12. Make use of Eq. (33.6) to calculate the electric dipole moment induced-in a 
hydrogen atom by a uniform electric field, and then find the polarizability. Show 
that the polarizability, electric field strength, and energy change are related by Eq. 
(33.8). 

13. A hydrogen atom is placed in an electrostatic potential that has quadrupole 
symmetry. Make use of Eq. (33.21) to calculate the ratio of the strength of the total 
quadrupole potential in the infinitesimal neighborhood of the nucleus to the strength 
of the externally applied quadrupole potential. 


14. Denote y, in Eq. (31.2) by |s). Show from Eqs. (31.4) and (31.6) that 


lH’) = YD) Wereals — alr — p) 


q=0 p=0 


Use this formula to derive Eq. (33.17). (D. H. Smith, private communication.) 


. 15. Calculate the leading term in the third-order perturbed energy of a hydrogen 
atom and a point charge Ze at a fixed distance R, for large 


16. Show that the WKB approximation gives the correct energy eigenvalues for all 
states of the harmonic oscillator. 


17. Apply the WKB method to the one-dimensional motion of a particle of mass m 
in a potential that equals —V> at x = 0, changes linearly with z until it vanishes at 
x = +a, and is zero for |z| >a. Find all the bound energy levels obtained in this 
approximation if mV,a?/A? = 40. 

18. Use the WKB approximation to show that an attractive three-dimensional poten- 
tial that falls off like r-* for large r has an infinite number of bound states if n < 2. 

19. A hydrogen atom in its ground state is placed between the plates of a capacitor. 
A voltage pulse is applied to the capacitor so as to produce a homogeneous electric 
field that has the time dependence E = 0 for é < 0, E = Eve’ fort > 0. Find the 
first-order probability that the atom is in the 2S state (200) after along time. What 
is the corresponding probability that it is in each of the 2P states? 

20. Make use of Eq. (35.20) to calculate the total cross section for photoionization 
of a hydrogen atom by a photon of energy fw. Show that the photon wavelength is 
large in comparison with the size of the atom if the photon energy is not much above 
threshold, so that the assumption of a uniform electric field made in deriving (35.20) 
is appropriate to the photoionization problem. Show also that the effect of the mag- 
netic field of the light wave can be neglected in comparison with that of the electric 
field. 


21. Explain why Eq. (35.32) does not contain Ho and H;, in a symmetrical manner. 
t 
Show also that Hf) can be replaced by I : H,(t) dt if H; depends on ¢, but only to 


first order. 
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22. Show that the remark below Eq. (35.33) about the excitation of the nth state in 
the adiabatic approximation is correct. 


23. Show that the remark below Eq. (35.34) about the validity of the sudden approx- 
imation is correct. : , 


24. A hydrogen nucleus of mass 3 is radioactive and changes into a helium nucleus of 
mass 3 with the emission of an electron that has about 17,000 ev maximum energy. 
Show that the sudden approximation can be applied to the extranuclear electron 
initially present in the hydrogen atom and is superior to other approximation methods 
that might be used. Calculate the numerical values of the probabilities that the 
resulting helium ion is found in its 18, 28, and 2P states if the hydrogen atom is 
initially in its 1S state. Give a qualitative discussion of the energy balance in this 
process. 


9 
Approximation Methods 
in Collision Theory 


As remarked at the beginning of Chap. 8, there are so few systems of 
physical interest for which exact solutions can be found that approxima- 
tion methods play an important part. in applications of the theory. 
Various methods that are useful in scattering problems are considered 
in this chapter. We start with the development of the scattering matrix; 
this provides a more general framework, in terms of which collision prob- 
lems of all kinds may be discussed, than that of Chap. 5. Although not 
an approximation method in itself, it is a convenient starting point for 
perturbation and other approximations, some of which are considered 
later in this chapter. 


36LITHE SCATTERING MATRIX 


The collision theory developed in Chap. 5 assumed that the scattering 
potential energy is independent of the time and hence was able to make 
use of energy eigenfunctions. Equation (18.10) describes a situation in 
which the incident flux is represented by a plane wave of infinite cross 
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section. As pointed out in connection with Fig. 21, this is not realistic 
since the incident beam is always collimated. However, collimation can 
be taken into account by superposing infinite plane waves that travel 
in slightly different directions; their angular spread in direction, measured 
in radians, is of the order of the ratio of the particle wavelength to the 
diameter of the collimating aperture. This angle is so small in practical 
cases that it does not affect the calculation of the scattering amplitude. 
Another way of describing the situation represented by Fig. 21 is to say 
that the propagation vector k has a definite magnitude k given in terms 
of the energy by E = h?k?/2u but that its transverse components k, and 
ky are very small in comparison with k; then k, is practically equal to 
k, and the ‘wave packet” is well defined only in the two transverse 
directions. Since only a single energy HE appears, the time-independent 
Schrédinger equation can be used. 

It will also happen that the time-independent model of a constant 
incident flux and a steady outgoing scattered wave is not appropriate in 
some circumstances. It certainly cannot be correct when the scattering 
potential energy V depends on the time, either intrinsically or because 
it may be convenient to think of V as being “‘turned on” at one time and 
“turned off” at a later time. Even when V is constant in time, it may 
be desirable to replace the incident plane wave with a moving wave 
packet that is rather well defined in all three spatial directions and gives 
rise to an outgoing spherical pulse as it passes over the scattering potential. 
Such a wave packet can be constructed by superposing time-dependent 
plane waves, in a manner similar to that discussed in the one-dimensional 
case at the end of Secs. 12 and 17. It then follows from Eq. (12.18) that 
the spread in each of the three components of k is of order 1/R, where R 
is the size of the wave packet. The corresponding spread in the energy 
is of order E/kR; although this is very small compared with E in practical 
cases, the time-dependent Schrédinger equation must still be used.’ 

We start by deriving integral equations for the wave function and 
for the closely related Green’s functions, and we then define the scattering 
matrix. This matrix describes the asymptotic behavior of the system in 
the sense that the spatial separation of the interacting parts is so large 
that interaction can be neglected, or the times are such that interaction 
has not yet occurred or is no longer occurring. Thus the scattering 
matrix describes the observable aspects of the scattering process. Since 
it is obtained by integrating the wave function through the space-time 
domain of interaction, it contains whatever information resides in the 
interaction that is relevant to scattering. At the same time, it shows 


1 For a fuller account of the wave-packet approach, see M. L. Goldberger and K. M. 
Watson, “Collision Theory,” chap. 3 (Wiley, New York, 1964); R. G. Newton, 
“Scattering Theory of Waves and Particles,” chap. 6 (McGraw-Hill, New York, 1966). 
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what limitations are placed on possible scattering processes by the fact 
that the hamiltonian is physically meaningful or has certain symmetry 
properties. When derived from the Schrédinger equation, the scattering 
matrix evidently contains no more information than the hamiltonian. It 
can, however, also give useful results when the interactions that appear 
in the hamiltonian are not definitely known, as is the case in nuclear 
physics. 


GREEN’S FUNCTIONS AND PROPAGATOR 


Even though we shall not be working explicitly with moving wave 
packets, it is desirable to retain the generality inherent in the time- 
dependent situation for as long as possible. We therefore start with 
the full Schrédinger equation 


AS VG) = HYG!) = (Ho+ VOOM) m= -hy 


which describes the relative motion of two particles with reduced mass p. 
Since Eq. (36.1) gives the first time derivative of y in terms of y itself 
and since higher time derivatives do not appear in the equation, the 
values of y for all r and one particular time ¢ suffice to determine y for 
all r and all ¢ (both earlier and later). Also, since the wave equation is 
linear in y, solutions can be superposed and the relation between y at 
different times.must be linear. This means that y must satisfy a homo- 
geneous integral equation of the form 


v(r',t’) = 1G’, x,t) (r,t) dr (36.2) 


where the integration is over all space. This equation also serves to 
define G, which is called the Green’s function that corresponds to the 
hamiltonian H. 

Equation (36.2) does not make a distinction between forward prop- 
agation of y in time (é’ > t), and backward propagation (t’ < t). It is 
sometimes convenient to have a clear separation between these two 
cases. For forward propagation we define the retarded Green’s function 
or propagator: 


Gtr’ r,t) = Gr’, r,t) fori’ >t 


(36.3) 
= 0 fort’ <t 


We also introduce the unit step function 6(r) defined by 


O(r) = 1 forr > 0 6(r) = 0 forr <0 (36.4) 


4 
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Then the equation 
ot! — thy(r’,t’) = tfGt(r’,U r,t) (r,t) dr (36.5) 


is the trivial identity 0 = 0 for t’ < t and is the same as (36.2) for ¢’ > t. 
For backward propagation in time we can anual y: define the 
advanced Green’s function: 


G-(r’,t’ x,t) = —Gr',t’sr,t) fort’ <t 


= 0 fort’ >t ee) 


The equation 
at — Uv’) = —1fG-(e' trv!) dr , (36.7) 


is again an identity for t’ > ¢ and the same as (386.2) for t’ < t. Thus 
Eqs. (36.5) and (36.7) together are equivalent to Eq. (36.2).: 

The following four integral relations involving G+ can be obtained 
from Eqs. (36.5) and (36.7) by making use of the fact that y can be 
chosen arbitrarily at any instant of time (see Prob. 1): 


Gt(r’,t'sr,t) = 7fGt(r’,t ri,t)Gt (r1,t:31,t) dr ift’ >t>t 
G(r’, r,t) = —tfG-(2’,t st1,t)G (r1,t157,t) dry if <t<t 
S(r — 2’) = ft(r’ tr,t)G-(titir,t) dr ift > t 
(rr — r’) = fG-(r’,t3r1,t)G+(11,0137,0) dri ift<t 

(36.8) 


FREE-PARTICLE GREEN’S FUNCTIONS 


An explicit expression for the Green’s function defined by Eq. (86.2) is 
readily found for the case in which V = 0. We denote it by Gp; it is 
proportional to the bracket term in Eq. (10.19) in the free-particle case 
and may be evaluated in analogy with the solution of Prob. 8, Chap: 3. 
The result is 


Go(r',t' x,t) = 3 lost 9 | exp et i (36.9) 
The corresponding retarded Green’s function, or free-particle propagator, 
is obtained from Eq. (36.3): 

Got(r’,t’ x,t) = a(t! — t)Go(r’,t’ r,t) (36.10) 
The advanced Green’s function is similarly obtained from Eq. (36.6): 


on (r’,t!r,f) = —0(t — t’)Go(r’,t’ x,t) . (36.11) 
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The validity of Eqs. (36.8) in the free-particle case can then be verified 
by direct substitution. 

It can also be seen from Eqs. (36.9) to (36.11) that there is a com- 
plex-conjugate relationship between Got and Go: 


Got (r',t's3,t) = Go-*(,t 7,0’) (36.12) 


We shall see below that this relationship also holds for G+ provided that 
V is real. 


INTEGRAL EQUATION FOR 


We now obtain an iterated form of the integral equation (36.5) for y.! 
We imagine that V is turned off except for a number of very short 
intervals of time between ¢ and @’: from ¢; to t; + At, from tz to te + Afe, 
etc., and finally from ft, to t, + Atn, where ! >t, > ++: >t >t. 
Then Got may be used as the propagator from each ¢; + Af; to tip1: 


O(iza — ti — Ats)W(tizs, bizs) 
= {Got Winn bini5 Ty ts + AL)W(i, t: + Ati) d*r; (36.13) 


Propagation from t; to é; + At; involves both Hy and V. Since At; 
will later be regarded as an infinitesimal, the two changes in y may be 
added. The Hy part involves Got and may be taken in with (36.13). 
The V part of the change in y is easily obtained from (36.1) and is 
—(t/h) V (t,,ti)¥(t.,t:) At,; this should be followed by free-particle propaga- 
tion from ¢; + Aé; to ti41 in accordance with (36.13), but the At; in the 
argument of Got can be omitted since we are interested only in quantities 
of first order in At; We thus obtain 


O(tiza — b)b(ips, bi41) 
=1 / Got (fiz1,tig2 Lite) E =. : V (1;,¢:) as | W (tits) d®r; (36.14) 
The resultant propagation from ¢ to v’ then gives 
a(t’ — Dy’) = / co fot’ t ajte) E — ; V (tn, tn) at, 
1Go* (Ln tn Ta—1yln—v) ofr 1Go* (Lo, te 3 1,t1) E = ; V (r1,01) ats | 
1G o*(r1,t157,00(r,t) d’rn + > + d'ry dr 
This may be multiplied out and simplified with the help of the first 


1R. P. Feynman, Phys. Rev. 76, 749 (1949); J. D. Bjorken and 8. D. Drell, ‘“Relati- 
vistic Quantum Mechanics,” secs. 6.2, 6.3 (McGraw-Hill, New York, 1964). 
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of Eqs. (36.8): 
ace’ — dur’) = 6 f Gote'tx)¥G,t) dr 
ae i ») i: J Got (r’,t! tot) V (tt AbGot (ratist,t)w(r,t) d2r; dr 


a , 
ec 2 [ff Gore rst) V Gat) ALGot Catestst) 
V (r5,t) A4G@ot (13,4; 37, Ov) dr; ar; i le eared, 


where ¢; > ¢; in the third term, etc. If now the time intervals At; during 
which V is turned on become more frequent and of shorter duration until 
V is on continuously, the summations may be replaced by integrations: 


Ot — Ove) = 6 f Gore x, OY(ED dr 
aa ; if dt; if J Got (r,t! tts) V (tt) Got Kater, )V(r,t) dr; dr 
a 5 f dt; f dt; i i} Got (n't rita) V (tits) @ot (ri, ti 305,t5) 


V (t4,t;)@ot (x,t; 5x, Dv (r,0) dr; dr; dr + eee (36.15) 


Note that all integrations over times may be taken from ¢ to @’; it is not 
necessary to require that ¢; > ¢; in the third term, for example, since 
Go* (r,,t;;r;,;) vanishes unless this is so. 


(a) a (b) 


Fig. 33 Schematic representation of the first term (a) and second term (6) 
on the right side of Eq. (36.15). 
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It is natural to interpret Eq. (36.15) as a perturbation series, the 
first two terms of which are indicated schematically in Fig. 33. The 
first term on the right side represents the amplitude corresponding to 
free propagation from ¢ to t’ (Fig. 33a), the second the sum of all single 
interactions at ¢; with free propagation from ¢ to ¢; and from ¢; to ¢’ (Fig. 
33b), etc. We rewrite Eq. (36.15) for convenience as 


a(t! — duce’) = tf Got rv (a,t) ar 
+5 | dts / Got (1',t' stints) V (rast: E J Got (r.,é:;2,0)v(1,t) dr 
a : J dt; ‘ Got (tatistit;) V (t,t) Got (ty, t;0,O0(1,0) dr; dr 
+ PP 38 | ar 


If the series in brackets is assumed to converge, then in accordance with 
Eq. (36.15) it is equal to @(t; — )y(ri,t:) or simply to P(r, t,) since t; > ¢. 
We thus obtain an integral equation for y that, unlike (36.5), involves 
only Got and not Gt: 


a(t’ — ty(r',t’) = i [ Got (r’,t’ s7,t)v(x,t) dr 
1 igh 
+5 i dt; i Got (rt! tat) V (tt)¥(tit,) dr: (36.16) 
The integration over ¢,; is from t to ?’. 


INTEGRAL EQUATION FOR THE PROPAGATOR 
Comparison of Eqs. (36.5) and (36.15) shows that 
Gt (r',t’ sx, 0) = Got (r’,t’ x,t) | 


+5 ff Gore’ t tot) Vrat)Gor (enter) ds dr 


+ i / i] il Got(r’,t rite) V Gi t)Got (i,t: 30,5) 
V (x;,t;)@ot (x;,t);1,¢) dt; dr; dt; d'r; + - +--+ (36.17) 
Again, if this series is assumed to converge, it may be summed to give 
the equation 
Gtr’, 7,0 = Gott 7,0) 


4: ; [ Goto’ rst) V CstE"(stA!) dt drs (36.18) 


USE OF THE ADVANCED GREEN'S FUNCTION 


The results of the two preceding subsections can be cast in terms of 
advanced rather than retarded Green’s functions and are given here for 
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completeness even though they are not as useful as the propagator equa- 
tions obtained above. 

We now have that ¢t’ < ¢ and therefore we start by imagining that 
V is turned off except for the intervals from ¢, — At, to tn, from 
tn-1 — Ata-1 to tn-1, etc., and finally from ¢, — At, to t:, where 
t!<tp< +++ <t;<t. Then the basic equation (36.14) for forward 
propagation is replaced by the following equation for backward propaga- 
tion in time: . 


Ot; — teat) | 


= =i f Ge Girntastot) [1+ 5 Vo) Ot [HEH dre (6.19) 


As compared with (36.14), the minus sign in front of the right side of 
(36.19) arises from the minus sign in (36.6) or (36.7), and the plus sign 
in the brackets appears because the change in y is calculated from ¢; to 
t; = At; rather than from t; to ft; + Al;. , 

It is now apparent that Eqs. (36.15) and (36.16) can be rewritten 
correctly by making the following substitutions: 


a(t’ - t) — TG — t') Got(r’,t! r,t) = Go (r’,t! r,t) ghia 
(36.20) 


In similar fashion, Eqs. (36.17) and (36.18) may be rewritten correctly by 
making the substitutions: 


Got (r’,t’ 3x,t) > Go-(r’,t’ sx, t) Gt(r’,t’ x,t) > G-(r’,t’ 31,1) (36.21) 


In all the rewritten equations, all time integrations are from ?¢’ to t. 

Equation (36.17), which expresses G't+ in terms of Got, and the corre- 
sponding rewritten equation, which expresses G~ in terms of Go-, can now 
be used, with the help of (36.12), to show that 


Gt(r’,t'sx,t) = G-*(r,t;7’,t) (36.22) 
provided that V is real. 


DIFFERENTIAL EQUATION FOR THE GREEN’S FUNCTIONS 


Thus far we have used the differential equation (36.1) for y to construct 
the integral equations (36.2), (36.15), and (36.16) for y, and the corre- 
sponding integral equations (36.17) and (36.18) for G+. The Green’s 
functions also satisfy a differential equation analogous to (36.1); this 
equation is expected to be inhomogeneous since G+(r’,t’ ;r,t) is, in effect, 
the wave function at r’, ¢’ that is generated by a momentary point source 
at r, t. 

We can find this equation for G+ by operating on Eq. (36.5) with 
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i(0/at’) — (1/h)H’, where H’ is H expressed in terms of the primed varia- 
bles r’, t’. It is necessary to make use of the relation 


d 
| in 6(r) = &(r) (36.23) 
which follows from (86.4). The result is 
16(t’ — ty(r’,t’) = 1 J (ix — ;#’) Gt(r’,tsr,Qv(r,8) dr 


where use has been made of Eq. (36.1). This equation shows that the 
right side must be proportional to 6(t’ — ¢), so that only y(r,t’) can appear 
in the integrand of the right side. But since y(r,t’) can be an arbitrary 
function of r at any particular time ?’, the rest of the integrand of the 
right side must be proportional to 6°(r’ — r). We thus conclude that 

(= — +H’) Gt(r’,t’ r,t) = (r’ — r) ot! — t) (36.24) 

It is easily seen that, if in the preceding paragraph we had started 
with Eq. (36.7), we would have found that G~ also satisfies (36.24). Thus 
G+ and G~ satisfy the same inhomogeneous differential equation but of 
course differ in their boundary conditions. G+(r’,t’;r,t) is the solution of 
Eq. (36.24) that is zero for t/ < t, and G-(r’,t’;r,t) is the solution that is 
zero for t’ > t. 


SYMBOLIC RELATIONS 


The integral and differential equations for the Green’s functions, obtained 
in the three preceding subsections, can be written in a symbolic form that 
is instructive. Equation (36.17) and the corresponding equation for G— 
may be written 


Gt = Got + A GotVGo® + h-7GotVGotVGot + ° - (36.25) 
In similar fashion, we may write Eq. (36.18) in the form 
Gt = Got + h-'Go* VG (36.26) 


Again, the differential equation (36.24) and the corresponding equation 
for G- may be written 


ce ae oe 
(5-5 )e =1 (36.27) 


where the primes have been dropped. On omitting the V term in H, we 
obtain in place of Eq. (386.27) 


.d 1 | 
(: at = i Ht) Got =] | (36.28) 
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Various checks can be made on the consistency of the last four 
equations. If we symbolically multiply both sides of Eq. (36.26) from 
the left with 2(0/dt) — (1/h)Ho, we obtain 


.9 1 wihgeccl nm ae bi 
where use has been made of (36.28); the result agrees with (36.27). As 
another consistency check, Eq. (36.26) may be written as 


(1 — AG *V)G* = Got 
or as 

G* = (1 — A7'Go*V) Got (36.29) 
A formal expansion of the inverse operator into a power series gives 

Gt = (1 + AG EV + h-°GotVGotV + - + *)Got 


which is the same as (36.25). This successful use of an inverse operator 
suggests that Eqs. (36.27) and (36.28) may be written symbolically as 


~1 -1 
a+ = («3-5 8) and Get = (45, - 4%) 
(36.30) 
respectively. 

Although such symbolic manipulations are often of heuristic value 
in suggesting new relationships, they cannot always be regarded as deriva- 
tions or proofs of such relations. In particular, inverse operators of the 
type that appear in Eqs. (36.29) and (36.30) are often singular, and 
special care must be taken in the interpretation of the singularity. As 
an example of this, we note that both the retarded arid advanced Green’s 
functions are given by the same symbolic expressions in (36.30). We 
shall see in Sec. 37 that a suitable treatment of the singularity in the 
inverse operator serves to provide the boundary condition that dis- 
tinguishes between forward and backward propagation in time. 


APPLICATION TO SCATTERING 


The formalism developed thus far can now be applied to a scattering 
situation. We imagine that V is not effective in the remote past or the 
far future, so that H can be replaced by Ho for t < —Ti and t > T,, 
where 7’; and T, are large but finite times. This may be because V was 
turned on at one time and turned off at a later time or because the initial 
and final states consist of wave packets that lie outside of the scattering 
potential. We call the free-particle wave functions ¢,(r,t); they satisfy 
Eq. (36.1) with V set equal to zero. The ¢. constitute a complete set of 
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functions at each instant of time and are, for convenience, chosen to be 
orthonormal; it is shown in Prob. 3 that orthogonality is preserved in 
time. Associated with each ¢. is a wave function y,+(r,t) that grows out 
of ¢, starting from a time in the remote past (t < —T1) before V was 
effective. Since y,+ satisfies Eq. (36.1), it can be written, in accordance 
with Eq. (36.5), in the form 


vat (r’,t’) = ifGt(r’,t' x,t) d.(r,t) d?r t<—-T, (36.31) 


where 6(t’ — t) may now be omitted. This y.+ is independent of the 
precise value chosen for ¢ so long as it is less than — 7}. 

At some time in the far future (tf > T2), when V is again ineffective, 
Yat is a solution of the free-particle Schrodinger equation and hence must 
be expressible as a generalized summation over the sz wath constant 
coefficients. Then the quantity 


(B|S|a) = (¢8,Pat) t>T, (36.32) 


is the amplitude of the free-particle state 8 that is contained, after the 
scattering has taken place, in the state that grew out of what was the free- 
particle state a before the scattering took place. Again, it is independent 
of the precise value chosen for ¢’ so long as it is greater than T2. Equation 
(36.32) is the amplitude for the transition from a to 8 and defines an ele- 
ment of the scattering matrix or S matriz.} 

Substitution of (36.31) into (36.32) gives a more symmetrical form 
for the S matrix element: 


(8|S|a) oe tf fos (r’ i ls t,t) da(t,t) dr’ dér 
i< —-T; > T, (86.33) 


Other forms can be obtained by making use of the following equation, 
which can be obtained from the complex conjugate of the free-particle 
version of Eq. (36.7) together with (36.12): 


A(t — top (n't!) = 1fGo*(r't' x,t) bp (t,t) dr 
= fog (r,t)Gor(r,t;7’,t’) dr (36.34) 


Thus substitution of Gt from (86.17) into (36. ) and use of (36.5) and 
(36.34), lead to 


(B|Sla) = (Bla) -5 J Of (,1)V (0,1) be(z,t) at dr 


-4 [fff eRe nve nasa x.) VGNbale) dt! dr dt dr 
+++ (36.35) 


1J. A. Wheeler, Phys. Rev. 62, 1107 (1937); W. Heisenberg, Z. Naturforsch. 1, 608 
(1946); C. Mgller, Kgl. Danske Videnskab. Selskab, Mat.-Fys. Medd. 28, 1 (1948). 
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where (Bla) = fog (r,t)da(r,t) d*r and is independent of ¢. Similar substi- 
tution of Gt from Eq. (36.18) gives 


(B|S|a) = (Bla) — : / if o% (r,t) V (r,t)¥at(r,t) dt d?r (36.36) 


Each of the time integrations in Eqs. (36.35) and (36.36) extends at least 
from — 7, to 7, and thus includes the entire interval during which V is 
effective. 

Expressions for the S matrix element similar to the above can be 
obtained by using the advanced Green’s functions Go and G-. For 
example, associated with each ¢, there is a wave function y,~(1,t) that 
grows into ¢s at some time in the far future (¢ > T») after V has ceased to 
be effective. It may be written, by using Eq. (36.7) with primed and 
unprimed variables interchanged, in the form 


ve (r,t) = —tfG-(a,t;2’,t’!) det’, t’) dr’ > T, (36.37) 


We might then expect that the S matrix element for the transition from 
a to B is given by the inner product of ¥s~ and ¢. evaluated at a time in 
the remote past ({ < —T1): 


(Verida) = Uf JO-*(n,ty) OR (0,1) alt) dr! dr 

— t< Ty t’>T, (36.38) 
Equation (36.22) shows that (36.38) is indeed the same as (8|S]a) given by 
(36.33), provided that H is hermitian or V isreal. The discrepancy when 
V is complex is physically plausible, since we saw in Eq. (20.2) that an 
absorptive potential (imaginary part of V negative) causes the total 
probability associated with a state to decrease as time increases. This 
means that y.t(r’,t’) is generally smaller for ¢’ > T, than the ¢. out of 
which it grows, whereas y¥g~(r,t) is generally larger for < —T7, than the 
¢s into which it grows. Thus (36.38) is expected to be larger than (36.32) 
if V is absorptive. 


UNITARITY OF THE S MATRIX 


An important property of the S matrix is that it is unitary if the hamilton- 
ian is hermitian. In order to prove this, we must show that SSt = 1 and 
that StS = 1. Any of.the forms for (8|S|a) obtained above, or the corre- 
sponding expressions containing advanced Green’s functions or wave 
functions, can be used for this purpose. With the form (36.33), a typical 
matrix element of SSt is 


(8|SSt|a) = S,(8|Sly7|Stla) = S,(8|S|y)(o|S|y)* 
= SSfoar’ tte’, x,t) dy(r,t) d?r’ dr 
SSda(r’ t)Gt* (rt sr” Obs (r,t) d3r”’ dr’! 
t< -T, U>T, 
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It is convenient to use the same times ¢ and ?¢’ in the matrix elements of 
both S and St, and it is permissible since (36.33) is independent of these 
times so long as V is not effective. Then since the ¢’s are complete and 
orthonormal, the summation over y introduces 6°(r — r’’”’) in place of 
d,o,, and we have 


(B|SStla) = fifog@’ CGH’ r)G-Gtr"t)dalt”t!) dr! der d3r!’ 


where use has been made of Eq. (36.22). The third of Eqs. (36.8) then 
shows that the integral of the product of the two Green’s functions over 
d*r gives 53(r’ — r’’), so that the right side is simply (6|a). That StS = 1 
is proved in similar fashion.’ 


SYMMETRY PROPERTIES OF THE S MATRIX 


It is expected that the S matrix will possess symmetries that reflect the 
symmetries of the underlying hamiltonian. In order to see that this is 
the case, we consider first one of the symmetry operations discussed in 
Chap. 7 that can be represented by a unitary operator (space or time dis- 
placement, rotation, or space inversion). Any such operator U will 
transform a free-particle state $s into another state os = Uds that 
represents a possible free motion of the system, since U commutes with 
Hy. If in addition U commutes with H, it will transform y+ into a state 
Yat = Uy.t that also represents a possible motion of the system in the 
presence of the interaction V. Thus the S matrix element between trans- 
formed states, given by Eq. (36.32), is 


(B’|S|e’) (p",Wa'*) oa (Ud, Ua*) 
(¢p,UtUypa.t) a ($6,¥a*) me (B|S|e) (36.39) 
all evaluated for t’ > Ts. For example, the amplitude for scattering 


between any pair of states caused by a spherically symmetric potential is 
numerically equal to the amplitude for scattering between those states 


1 The proof given here for the unitarity of S is satisfactory since V is turned on and 
off at finite times in the past and future. The situation is more complicated if V is 
strictly constant in time, and it becomes ineffective in the remote past and far future 
because the initial and final states consist of wave packets that lie outside the range 
of V. One complicating factor is that H may possess bound states, and it can then 
be shown that the wave-packet states are orthogonal to these bound states and hence 
do not form a complete set. Nevertheless, S is still unitary if V is real, since energy 
conservation prevents the bound states from becoming occupied. For further dis- 
cussion, see C. Meller, loc. cit.; F. E. Low, in ‘‘Lecture Notes of Brandeis University 
1959 Summer Institute in Theoretical Physics’; W. Brenig and R. Haag, Fortschr. 
Physik 7, 183 (1959), Sec. 1, English translation published in M. Ross (ed.), ‘Quantum 
Scattering Theory,” (Indiana University Press, Bloomington, Ind., 1963); Goldberger 
and Watson, loc. cit.; Newton, loc. cit. 


APPROXIMATION METHODS IN COLLISION THEORY 311 


that are obtained by rotating the two members of the original pair in the 
same way. 

The left side of Eq. (36.39) may also be written (8|UtSU|a). Then 
since the kets |a) and |8) are arbitrary, we conclude that 


UtSU =S8 or [U,S] = 0 (36.40) 


Thus if U commutes with H, it also commutes with the operator S. 

For the antiunitary operation of time reversal, discussed in Sec. 29, 
the situation is somewhat more complicated. The time-reversal operator 
T transforms a free-particle state ¢, into another state that corresponds to 
reversed linear and angular momenta; we therefore denote it schematically 
by ¢-s. However, in accordance with the discussion of Eq. (29.12), 
operation with T also reverses the sense of progression of time, so that 


Top(t) = o-6(—4) (36.41) 


For example, if ¢s(t) = C exp z(kg - r — wet) for a spinless particle, then 
Eqs. (36.41) and (29.13) tell us that ¢s(t) = C* exp 1(—kg +r — wt), as 
expected. 

Suppose now that T commutes with H, so that the interacting 
system is time-reversal invariant. The simplest situation is that in which 
V is constant in time except for being turned on and off in a symmetrical 
manner att = +7 >. Then T will transform y,+ into a state that also 
represents a possible motion of the system. This state will not only have 
reversed linear and angular momenta but will also grow into ¢_. in the far 
future rather than growing out of ¢_. from the remote past; we therefore 
denote it by y_,. Again, operation with T reverses the sense of progres- 
sion of time, so that the analog of (36.41) is 


That (t) = ~.(—2) (36.42) 


For a system that is time-reversal invariant, so that H is hermitian 
or V is real, the discussion of Eq. (36.38) shows that an S matrix element 
can be written in the form 


(—a|S|—8) = ¥—(),¢-«)re- 1. = Wia(—8),6-0(—-D)io70 
We now substitute into this from (36.41) and (36.42) and make use of 
Eq. (29.138): 
(—alS|—8) = (That(t), Toa(t)) = (UK pat (t), UK ba(t)) 
= (Kyat (t),Koba(t)) = (b8,¥a*(d)) 
=(6|Sla) (36.43) 


all evaluated fort > To. Equation (36.43) shows that the amplitude for 
scattering from an initial state a to a final state 8 is numerically equal to 


. 
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the amplitude for scattering from the momentum-reversed final state —8 
to the momentum-reversed initial state —a, if the system is time-reversal 
invariant. This is the reciprocity theorem [Eq. (20.16)] which was shown 
. in Sec. 20 to be valid even when V is complex. 


37 STATIONARY COLLISION THEORY 


The S matrix theory developed in the preceding section includes the 
possibility of an arbitrary variation of H with time, subject only to the 
restriction that free-particle states can be defined for finite times in the 
remote past and the far future. Although it is interesting to have such a 
comprehensive formalism available in order to show that any scattering 
problem can be formulated in principle, useful results are generally obtain- 
able only in special cases. The situation of greatest physical interest is 
that in which V is independent of the time, except for being turned on at 
one time and off at a later time. 

We first consider this stationary situation by imagining that V is 
effective for a very long but finite time tp. A particular S matrix element 
can then be related to a transition probability per unit time, in much the 
same way as in the time-dependent perturbation theory developed in Sec. 
35, although without the restriction to the case in which V is a small 
perturbation. We then go a step further and relate this transition proba- 
' bility to a scattering cross section. Following this we develop an alter- 
native procedure for the limit in which fo is infinite, so that the situation 
is strictly stationary. In this case it is not- meaningful to think in terms 
of a transition probability; rather, we shall obtain the scattering amplitude 
and cross section directly. 


TRANSITION MATRIX 


The initial and final free-particle states ¢. and ¢g, in terms of which the S 
matrix element (6|S|a) is specified, can be chosen to have well-defined 
energies. We shall suppose that the times during which V is being turned 
on and being turned off are of order Af and that, in between, V is constant 
for a time ¢t) that is much longer than At. This means that V(r,t) is 
replaced by V(r)g(t), where g(é) has the general form shown in Fig. 34. 
Then the time Fourier transform of Vg contains frequencies that lie in a 
range of order 1/At about zero. It follows from (36.36) that (6|(S — 1)|a) 
is very small unless the energy associated with y.* is in a range of order 
h/At about the energy associated with ¢s. We shall assume that At 
can be made large enough so that this energy range is experimentally 
unobservable. We shall also assume that to) can be made so large in com- 
parison with At that the uncertainty in the effective on time of V is 
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At g(t) | At 


Fig. 34 Schematic plot of the function g(t) that determines the time 
dependence of V. 


unimportant. This means that V can be on for a well-defined time while 
energy is substantially conserved during the scattering process. 
‘With this physical situation in mind, we put 
$.(f,t) = Ug (Te wat (Ho 7 Ea)Ua(¥) = 0 
vat (r,t) = xat(r)e at (H — E.)xat(t) = 0 
where E, = fw, and V(r,t) can be replaced by V(r)g(t). We then make 


use of Eq. (36.36) to define an element of the transition matrix or T 
matriz through 


(\(S — Ila) = — ¢BIT|a) [~, a(deiPot at 


(37.1) 


(37.2) 
(BT la) = f ugee)VG)xet(t) dr pe = wp — On 


TRANSITION PROBABILITY 


The definition of the scattering matrix [Eq. (36.32)] shows that |(8|S|a)|? 
is the probability that a system originally in the state a is finally found to 
be in the state 8B. So long as a and £ are different states, this is equal to 
\(8|(S — 1)|a)|?, and. we can use Eq. (37.2) to calculate the transition 
probability. If now we were to assume that At = 0 in Fig. 34, the situa- 
tion would be similar to that described in connection with Eq. (35.12) and 
Fig. 31. In order to avoid this, we shall not assume such a square form 
for g(t). 

The initial and final states in a scattering problem are continuously 
distributed, or nearly so if box normalization is used. Thus there is 
always a group of final states 6 that have nearly the same energy and for 
which (8|T|a) is roughly independent of 8. Such a group of states will 
also have nearly the same values for all the other parameters, besides the 
energy, that specify the states. For example, if the final states are 
momentum eigenfunctions, the states 6 will be characterized by a small 


314 QUANTUM MECHANICS 


range of direction of the momentum vector as well as of its magnitude. 
We write the number of these states as p(8) dEs, where dEg is the energy 
range and p(8) is evidently a differential in the other parameters. Then 
the total probability of finding the system in one or another of this group 
of final states is a summation of |(@|(S — 1)]a)|?: 


[ \Kal(S — 1)la)|29(6) ad 
sg if (B|T|o)|? F g(thetsat dt |" p(B)h dupa 


where we have replaced dEg by h dwga since wa is constant. Now the 
Fourier transform of g(t) is strongly peaked at ws. = 0, so that we can 
remove p(8)|(8|T'|a)|? from inside the integral over ws, and extend the 
limits to +o. Then 


ae i g(te*Ba‘ dt i duge = on te lg(t)|? dt 


and this is essentially equal to 2mt if g(t) has the form shown in Fig. 34. 
The transition probability per unit time is then given by 


w= f |(6|(S — 1)]a)|*(8) dB = = 9(8)|(6|T|a)|? (37.3) 
ty h 


Equation (37.3) is exact. Comparison of the second term on the 
right side of Eq. (36.35) with the corresponding term of (36.36) shows 
that a perturbation approximation to (37.3) is obtained by replacing xat 
by u. in (37.2). This replacement gives Fermi’s ‘‘Golden Rule No. 2.’’! 


SCATTERING CROSS SECTION 
An expression for the differential scattering cross section is most con- 


veniently obtained from Eq. (37.3) by using box normalization. We 
choose the ug to be the momentum eigenfunctions 


ug(r) = L-* exp (akg - r) 


in which case p(B) is given by Eq. (35.18): 


3 
p(B) = Ara kp dQ, (37.4) 
where dQ, is the infinitesimal element of solid angle associated with the 
direction of kg. 

The value of w obtained by substitution of (87.4) into (37.3) is the 
number of scatterings into dQg per unit time when there is initially one 
system in the volume L?. This is an incident flux of »./L? per unit area 


1K. Fermi, ‘“Nuclear Physics,’’ p. 142 (University of Chicago Press, Chicago, 1950); 
see also Eq. (35.14). 


% 
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and time, where v, is the relative initial speed. Since the differential 
scattering cross section is defined as the scattering per unit incident flux, 
we have 


o(Kkp,Ka) d%p = , 
so that 
o(Ikp)Ka) = ce ( a) \(8|T]a)|? (37.5) 


where we have replaced kg by yvg/h. In the case of elastic scattering now 
under consideration, energy is conserved and vg = 2. 

The wave functions ug and xt that appear in the 7’ matrix element 
(37.2) are each normalized to one system in the volume L*, so that 
(g|T|e) is proportional to 1/L’. Thus the differential cross section is 
independent of L, as of course it must be. It is also worth noting that 
the total transition probability wto, where w is given by (37.3), is propor- 
tional to fo/L?. Since the use of box normalization implies that the limit 
L— o will ultimately be taken, the apparent difficulty associated with 
the fact that the probability wt) can be greater than unity if to is made 
sufficiently large does not in fact arise. This probability actually becomes 
vanishingly small for any finite value of t) as L— ». 


GREEN’S FUNCTIONS FOR STATIONARY CASE 


We now develop an alternative procedure that is applicable in the limit in 
which fp is infinite, so that the situation is strictly stationary and energy is 
strictly conserved. It is necessary to assume that the S-matrix formalism 
developed in Sec. 36 remains valid in this limit; this can be shown along 
the lines of the wave-packet approach referred to in the footnote near the 
end of that section. We cannot now think in terms of a transition proba- 
bility per unit time but instead obtain the scattering amplitude and cross 
section directly. This strictly stationary situation is most easily and 
most directly approached by starting with the time-independent 
Schrédinger equation. Although we shall follow this procedure a little 
later, we first devote this and the next subsection to a treatment that 
starts from the time-dependent Green’s functions already discussed in 
Sec. 36. 

An explicit expression for the Green’s function G(1’,t’;r,t), defined in 
Eq. (36.2), is easily obtained from Eq. (10.19) in the stationary case. In 
our present notation it is . 


G(r',tsr,t) = —Saxalr’) x2 (r)e alt’) (37.6) 


where the x. constitute a complete orthonormal set of solutions of the 
time-independent Schrédinger equation (H — E.)x2 = 0. The fact that 
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G is a function only of ¢’ — t, and not of ¢’ and ¢ separately, is a conse- 
quence of the invariance of H with respect to time displacement in the 
stationary case. In accordance with (36.3), the propagator is obtained 
by multiplying this G by 0(f’ — t). Since the form (37.6) for G resembles 
a Fourier expansion in the time difference t’ — ¢, the possibility of separat- 
ing out the time dependence by expressing 6(t’ — t) as a Fourier expansion 
suggests itself. We now show that this has the form 


‘ 1 o_o, dw 
@(r) = lim — 5 i, € (37.7) 


e>0t —o (4) + 1€ 


We note first that the derivative of this with respect to 7 is 


: 2 eT dy 


Qn J—« 


which in accordance with Eq. (11.10) is equal to 6(r), as expected from 
Eq. (36.23). This result is independent of whether the denominator of 
the integrand in (37.7) is chosen to be w + de or w — ie, that is, whether 
the pole in the integrand is below or above the contour of integration, 
which is along the real axis. However, the two choices give different 
values for the integral. The first choice corresponds to a pole at ~—7e as 
shown in Fig. 35a. For 7 < 0, this contour may be completed with an 
infinite semicircle in the positive imaginary half plane, as in Fig. 35b, 
since the exponential becomes vanishingly small there and contributes 
nothing to the integral; then 6(r) = 0. For7 > 0, the contour may be 
completed as in Fig. 35c, and the integral is equal to —2z27 times the 
residue of the integrand at the only pole (w = —7e) that lies within the 
contour. Then, in the limit e— 0+, 6(r) = 1. It is easily verified that 
the second choice for the denominator, w — ze, leads to 6(r) — 1 instead 
of to Eq. (36.4). 

We now combine Eqs. (37.6) and (37.7) to obtain an expression for 
Gt(r’,t’;r,t). It is convenient to change the variable of integration from 
w to w + we and to write Gt in the form of a Fourier analysis: 


1 fe 
Gt r’,t! rt)= > Gt rr —tw(t'—t) g 
eee = (37.8) 


Gott’ r) = Salw — wa + te) !x0(r’) x2 (r) 


where the limit « — 0+ is always implied. It is easily established that the 
advanced Green’s functions G-(r’,t’;r,t) and G,~(r’,r) are related to each 
other by the first of Eqs. (37.8) and that G,~ is given by the second equa- 
tion with +e replaced by —e in the denominator of the summand. In 
analogy with Eq. (36.24), it can also be shown from the completeness and 
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Fig. 35 Contours for the integration of Eq. (37.7). 
orthonormality of the x. that G+ satisfies the equation 
(« — 5’) G.*(r’,r) = (r’ — 4) (37.9) 


where H’ is H expressed in terms of r’. 


GREEN’S FUNCTIONS AS INVERSE OPERATORS 
We are now in a position to see how the inverse operators (36.30) can be 
written explicitly. On restoring the primes dropped just before (36.24), 
the first of Eqs. (36.30) becomes 
a . re) 1 , Re / / 
Gt(r’,t’ 7, = ta 5H 53(r’ — r)d(t’ — 2) 


_(,2 _1l1pV' q ‘ 1 fe wero 
-(i3 7H) Sexa(t’)x2(t) = ["_ ¢ dw 


318 QUANTUM MECHANICS 


Here, the completeness relation of the x. has been used to substitute for 
63(r’ — r), and Eq. (11.10) has been used to substitute for 6(¢’ — t). A 
typical term of the summand-integrand, x.(r’)e~*”, is an eigenfunction of 
the operator 7(0/dt’) — (1/h)H’ with eigenvalue w — wa. We therefore 
try to replace any function of this operator, when operating on this typical 
term, by the same function of the eigenvalue. If this is done, we obtain 


ke | 
Gt(r',t’ i,t) _ =~ | Sa(w = Wa) Xa (t’) xx (r) eet’) dw 
Zr? 


This provides an example of the singularity mentioned below Eq. 
(36.30). The integral over w is not well defined unless the pole at we is 
removed from the real axis. This may be accomplished by replacing 
® — Wy by w — We + ze in the denominator of the integrand, and this 
leads to the correct G+ as obtained in Eqs. (37.8). The insertion of +7 
both removes the singularity and specifies whether the Green’s function 
propagates forward or backward in time. 


STATIONARY PROPAGATOR 


The stationary Green’s functions G,,+(r’,r) defined in (37.8), which are the 
Fourier transforms of the time-dependent Green’s functions G+(r’,t’ ;r,¢), 
can also be obtained directly from the time-independent Schrédinger 
equation. For a stationary scattering problem, we are interested in a 
solution of this equation that has the asymptotic form of a plane plus 
outgoing scattered wave, as in Eq. (18.10). We expect this to be associ- 
ated with the propagator G+ rather than with G,~, since ~at(t,t) was 
defined in Eq. (36.31) to be the wave function that grows out of the free- 
particle wave function ¢,(r,t) starting from a time in the remote past 
before V was effective. In accordance with (37.1), we call this stationary 
solution xa+(r) and write it as 


Xat(r) = Clexp (ck. -r) + v(r)| = walt) + Co(r) — (37.10) 


where u,(r) is the suitably normalized plane wave defined in (37.1). The 
coefficient C is equal to L~? for box normalization and to (27)? for con- 
tinuum or 6-function normalization (see Sec. 11). Since the scattered 
wave associated with x.* is outgoing, the asymptotic form of v(r) may be 
written, in analogy with (18.10), as 


v(r) aaa * f(UeJa)eibo (37.11) 


where the vector k, that appears in the scattering amplitude f has magni- 
tude k, and the direction of r. It is important to note that the coefficient 
C is the same for x,*+ and for wa, since v(r) contributes a negligible amount 
to the normalization integral. Further, this procedure is valid as well for 
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complex V, since stationary scattering solutions (EZ, real) can always be 
found. 
Substitution of (37.10) into the wave equation 


[Ho + V(r) — Ealxat(r) = 0 
shows that u(r) satisfies the inhomogeneous differential equation 
1 
(Ho — Ea)u(t) = — G V@)xat() 
This equation is easily solved with the help of the free-particle version of 
Eq. (87.9), to give 
v(x’) = (AC) GGT) V ©) xa* (x) d?r (37.12) 


Combination of (37.10) and (37.12) shows that xa* satisfies an inhomog- 
eneous equation, called the Lippmann-Schwinger equation’: 


Xat(r’) = ua(t’) + ASG, (1) V(r)xat(r) dr (37.13) 


By making use of the free-particle version of Eq. (37.9), we may write 
this symbolicaHy as 


Xat = Ua + (Eu — Ho + te) Vi xa (37.14) 


Since Ho:is hermitian, its eigenvalues are real, and the inverse operator is 
nonsingular so long as e is finite. 
Equation (37.13) can be iterated to give an infititte series for xa?: 


xat(t’) = ual’) + AGE, (t 2) V(r) ua(t) dr 
+ A-*f [GZ (r' 0s) V(r) Gh (tut) V(r) ua(r) dr, d'r + +++ (37.15) 
Also, Eq. (37.14) can be manipulated symbolically to give 
Xat = ta + (Ha — H+ te) “Vue (37.16) 
which can be written out in the form 
Xa*(t’) = ust’) + AGS) V(r) ua(r) dr (37.17) 
Equation (37.17) can also be derived without using symbolic relations. 


FREE-PARTICLE PROPAGATOR 


In order to make use of equations like (37.13), we need an explicit expres- 
sion for the free-particle propagator Gp, (r’,r). For positive w, the second 


1B. A. Lippmann and J. Schwinger, Phys. Rev. 79, 469 (1950). 
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of Eqs. (37.8) becomes 
Gout (r’,r) = Saw — Wa + te)~ Lus(e ute) 
= / (k? — ka? + ie)? exp [ilta(r’ —1)] d%kq (87.18) 


where k? = 2uw/h. Because of the space-displacement invariance of Ho, 
Gowt is a function only of r’ — rand not of r’ andr separately. Integra- 
tion over the angles of k, with r’ — r as polar axis gives 


Goot(r’,r) = ” (k? — Ka? + t€)— he sin kap aka 
0 . 


ML 
whip 
where p = |r’ —r|. The k, integral is most readily evaluated by extend- 
ing the lower limit to — ©, writing sin kap in exponential form, and using 
residues as in the evaluation of (37.7). The result is 


Goot(t',r) = — 5 [r’ — r[- exp (ik|r’ — rl) w >0 
4 
pack 72) (37.19) 


For negative w, the only change is in the sign of 7 in the exponent, where 
now k = +(2u|w|/n)!. Thus, for both positive and negative w, substi- 
tution into the first of Eqs. (37.8) shows that Got is a superposition of 


waves that travel radially outward from r to r’, in agreement with Eq. 
(36.10). 


SCATTERING AMPLITUDE 


The scattering amplitude f(k,,k.) can now be found by comparing the 
asymptotic form of (37.12) with (37.11). We note that r is essentially 
restricted to finite values since V(r) appears in the integrand of (37. 12). 
Thus in the exponent of G{,, we have 


wa 


2 
lr’ —r| = r’ —rcos 6’ +O = 
r 


and in the denominator 


i — att =r +0(5) 


where 6’ is the angle between r’ andr. Equation (37.19) then gives 


, _ _# tke (r’—~r cos 0”) 
GSaoa(t’,T) Tae. Dake 


and substitution into (37.12) gives the asymptotic behavior 


V(t") <> — gapecp ca f etareee Ve) xet(E) dr (37.20) 


APPROXIMATION METHODS IN COLLISION THEORY 321 


We now define a vector Ky which, like the vector k, in Eq. (37.11), 
has magnitude k, and the direction of r’, so that kar cos 6’ = kg-r. Com- 
parison of (37.11) and (37.20) then gives 


f(ep)ka) = — 5-Fag f exp (ake 1) V@)xet (a) ar 
= — apace | FOV OxetG) ar 


= — aeqqop (IT la) (37.21) 


where the 7’ matrix element is defined in Eq. (37.2). Since, in accordance 
with (37.10), ug and x.* both have asymptotic magnitude |C|, the scatter- 
ing amplitude is independent of the choice of C, as of course it must be. 

As shown in Eq. (18.11), the differential scattering cross section 
from the initial propagation vector k, to the final propagation vector kg 
is equal to |f(ks,ka)|?.. Thus (37.21) is in agreement with (37.5) when box 
normalization is used. 


INGOING WAVES 
It is easily seen that Eqs. (37.13) to (37.19) can be rewritten correctly by 
making the following substitutions: 

Xat > Xam Ghat’ st) > Gouat’ st) tt (37.22) 
In analogy with (37.11), xa~ has the asymptotic form of a plane plus 
ingoing wave: 


e700) oC exp (ike #) + FF Uinkalete| (37.23) 


Comparison of the original and rewritten equations then shows that if 
V is real 


Xat (r) = xx (r) f(K,,Ka) = f- i (k,, —k,) (37.24) 


where x_, is the time-reversed state to x.*+ and has the opposite momen- 
tum —k,. 

Still considering the case in which V is real, we can rewrite the T 
matrix element in terms of xg~. From Eqs. (37.2) and (37.16) we have 


(B|T a) = (up,Vxat) = (up,Vua) + (us, VE — H + ie) Vue] 
= (ug, Vue) + [((E — H — te)—'Vug, Vue] (37.25) 
where FE = E, = Es. The rewritten version of (37.16) is 


Xe” = ug + (E — A — ie) Vug 
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so that (87.25) becomes 
(B|T|a) = (xs7, Vue) V real (37.26) 


It is sometimes useful to have available an expression similar to 
(37.26) when V is complex. We accordingly define wave functions 
Xa7+, which are stationary solutions of the Schrédinger equation with V 
replaced by V*: 

Xa? + = Ug + (BH — Ao + te)-“'V* xe 

= Ua + (KE — At + te)'V* un 
X-a”~ is the time-reversed state to x.+ but does not represent a physically 


possible behavior of the system since H is not time-reversal invariant. 
Equation (37.26) then becomes 


(8|T la) = (xs7~,Vua) V complex (37.28) 


(37.27) 


S MATRIX FOR STATIONARY CASE 


An explicit expression for the S matrix element can be obtained by allow- 
ing the cutoffs on g(t) in Eq. (37.2) to extend to + : 


Omi 
(8|(S — Dla) = — $* (ws — wa)(8|T la) 
e 
; hike 
where use has been made of the fifth of Eqs. (11.13). We also require an 


expression for (8|a), which with continuum aed is 


(Bla) = 0°(dsy — Hey) = Se — Be) 268 —%0) 504, — 9.) (87.30) 


where the weight functions appropriate for spherical coordinates have 
been included. We can now combine Eqs. (37.21), (37.29), and (37.30) 
with C = (27)-}, to obtain the S matrix element: 


(B|S|«) = 2p te ke) te | Mee Oa 9a) 5( dp _ Pa) a foe) | 
(37.31) 


Equation (37.31), together with the assumption that S is unitary, 
can be used to derive the optical theorem and the generalized optical 
theorem, discussed in Sec. 20. 


5(kg — ka){B|T|a) (37.29) 


ANGULAR MOMENTUM REPRESENTATION 


The S matrix element (37.31) is expressed in terms of the polar coordi- 
nates of thé initial and final propagation vectors: 


(B|Sla) = (kpOs¢p|S|kababa) 
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It can be transformed to the representation specified by the energy (or 
the magnitude of the propagation vector) and the angular momentum by 
making use of the unitary transformation matrix (27.29): 


(Og|lm) = Yim(6,¢) | (37.32) 
We then have 


(kelamg|S | KkalaMa) 
= So, 0%ada(larra| Os bp)(kp0pbs|S|kababa){ Oa balan) (37.33) 


The generalized summation symbols are angle integrations in this case 
and include the weight functions sin 0, sin 6., as remarked after (27.29). 

Equation (37.33) takes a particularly simple form if V is spherically 
symmetric, since then, in accordance with Eq. (36.40), S commutes with 
the angular momentum operater. The scattering amplitude depends 
only on the angle 6 between k, and kg and may be written as in Eq. 
(19.11): 


f(kts,fee) = —_ Y (21 + 1) @%* — 1)Pi(cos 6) (37.34) 
ort ; 


Here cos 6 = cos 63 cos 04 + sin 6 Sin 9. cos (ds — $a), and, in accordance 
with Eqs. (32.10) and (14.16), 


l 
Pi(cos 8) = sy De Vim (6p, 68) Yi, (Basa) (37.35) 
Substitution of Eqs. (37.31), (37.32), (37.34), and (37.35) into (37.33) 
gives 


(kelpmg|S|kalaMa) 
= Mee fff sin 63 dg ddp sin Oa d0a dba Vigmg (96, 8) 


bee a) 5(¢8 oe a) + 2 y ( (e?% cad 1) Yim(9p,¢8) Vin (0ass) 


sin 6, 
5(kg — ka) 
k,? 


m=—l 


Vigma(Oa;Pa) = €?%ta Sista Smgma (37.36) 
where use has been made of the orthonormality of the spherical harmonics. 

Thus S is diagonal in the representation specified by the wave 
number & (or the energy) and the angular momentum quantum numbers 
land m. This is to be expected since a static spherically symmetric 
potential does not change either the energy or the angular momentum 
during the scattering process. This representation of the S matrix is” 
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easily verified to be unitary provided that the phase shifts are real, as 
is the case when H is hermitian or V is real. It should also be noted, 
from the discussion of Sec. 19, that the radially incoming wave r—e—‘#r 
is not modified in the partial wave analysis, whereas the outgoing wave 
for a particular / is multiplied by e?*. Thus each diagonal element of 
the scattering matrix has a natural interpretation as the modification of 
a particular outgoing or scattered partial wave as compared with the 
corresponding incoming or incident partial wave. ; 


387APPROXIMATE CALCULATIONS 


In the preceding section, an exact relation between the scattering ampli- 
tude and the 7 matrix element [Eq. (37.21)] was derived. The 7’ matrix 
element in turn is expressed in terms of the stationary outgoing solution 
Xa*(r) through Eq. (37.2), and an infinite perturbation series (37.15) is 
available for the calculation of x2+. The first order of this perturbation 
series leads to the Born approximation,! which consists in replacing xat 
by uz. We show first how this method can be applied in some interesting 
cases of potential scattering and then how it can be generalized to treat 
scattering by a system with internal degrees of freedom (electron-atom 
scattering). Finally, we discuss the eikonal approximation, which is 
closely related to the WKB approximation. 


BORN APPROXIMATION 


The procedure outlined in the preceding paragraph leads at once to the 
Born expression for the scattering amplitude: 


fa(Kp,Ka) = — aancp ug (r)V (r)ua(r) d3r 


— 4 [ VO) exp (iq-1) a q =k, — Kg 
(38.1) 


Thus the Born scattering amplitude is proportional to the spatial Fourier 
transform of the scattering potential with respect to q, where fq is the 
momentum transfer from the incident particle to the scattering potential 
during the collision. It is reminiscent of the time Fourier transform that 
appears in the perturbation expression for the transition amplitude [Eq. 
(35.9)]. If the potential V is spherically symmetric, Eq. (38.1) may be 
simplified by integrating over the angles of r with respect to q as polar 


1M. Born, Z. Phystk 38, 803 (1926). This method appears to have been first used by 
Lord Rayleigh, Phil. Mag. 12, 81 (1881); see also R. Gans, Ann. Physik 76, 29 (1925). 
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axis. In this case, fe depends only on the angle @ between k, and Kg: 
fa(0) = — 3 ai r sin gr V(r) dr q = 2k sin 10 (38.2) 


where k is the magnitude of k, and kz. 

An interesting example of the application of Eq. (38.2) is to the 
elastic scattering of an electron by a neutral atom that is represented by 
a simple form of screened coulomb potential: V(r) = —(Ze?/r)e77!*. 
This behaves like the nuclear coulomb potential for atomic number Z 
when 7 is small and falls off rapidly when r is large in comparison with 
the “radius” a of the atomic electron cloud that screens the nucleus. 
The Thomas-Fermi statistical theory of the atom (see Sec. 47) shows that, 
for moderately heavy atoms, a is roughly equal to h?/me?Z!, where m is 
the electron mass. Substitution of this potential into (38.2) gives 

2uZe? 
The cross section |fs(6)|? is in agreement with the Rutherford result 
(21.11) when the momentum transfer is large enough so that 1/a? can be 
neglected in comparison with g? in the denominator. In the analogous 
classical situation, the incident electron passes close enough to the nucleus 
so that the atomic electrons are relatively ineffective in screening the 
nuclear potential. 

The total cross section is most easily evaluated by changing the 
variable of integration from @ to g = 2k sin $6, in which case sin 6 dé is 
replaced by qg dg/k?: 


x [5 lie)! sin a6 = FF [* |fo(a) "a da 


_ 161y?Zeta4 
 K*(4k2a? a 1) 


fa(0) = ante = f ecgne er = (38.3) 


With the above Thomas-Fermi expression for a, and neglecting the differ- 
ence between m and yu, the total cross section becomes 41Z'/k? at high 
energies (ka >> 1). This agrees in order of magnitude with the result of 
a numerical integration! of the scattering produced by the Thomas- Fermi 
potential. 


.VALIDITY OF THE BORN APPROXIMATION 


Since the only approximation involved in the preceding subsection is the 
replacement of x_* by ua in the T matrix element, a convenient criterion 
for the validity of the Born approximation can be obtained by requiring 
that |v(r)| in Eq. (37.10) be small in comparison with |exp (ik. + r)| = 


1E. C. Bullard and H.S. W. Massey, Proc. Cambridge Phil. Soc. 26, 556 (1930). 
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We must use perturbation theory to estimate v(r), by making use of 
(37.12) and replacing x.+ by ua. in the integrand. It is then most con- 
venient to make this estimate at r = 0, and this should be satisfactory 
since it is likely that v(r) is largest at the center of the scattering potential. 
The resulting validity criterion is probably sufficient but may be more 
stringent than is actually required; for example, the small-angle scattering 
(small momentum transfer) may be given correctly by the Born approx- 
imation when the large-angle scattering is not. 
We obtain in this way 


v(0) 


R 


(hC)— / Gooat (0,1) V(r) ue(t) dr 


= Fi ih Te eV (rye r dr dw 
= — i [7 ee -— DVO ar (38.4) 


where w is the cosine of the angle between k, and r. With the screened 
coulomb potential substituted into (38.4), the condition |v(0)|<«<1 
becomes 


2uZe* 


us — 


I ” sin x eit—z/ka a 
0 x 


where x = kr replaces r as the variable of integration. For ka <1, 
this becomes 2uZe?a/h? < 1 which, with the earlier approximate expres- 
sion for a, is equivalent to Z! < 1; thus the Born approximation cannot 
be used for the scattering of slow electrons by atoms. For ka> 1, the 
criterion becomes (Ze?/hv) In ka < 1, and the approximation is useful for 
fast electrons so long as Z is not too large. It turns out that this last 
result is substantially unaffected by relativity theory; thus the Born 
approximation becomes poor for the heavier elements since v cannot 
exceed c, and e?/hc = y4y. 


SCATTERING FROM TWO POTENTIALS 


It is sometimes convenient to write the potential as the sum of two parts, 
V = Vi + V2, and then to express the JT matrix element as the sum of a 
part that would arise from V, alone and a correction term.! Such a 
procedure is useful if the wave equation for V, by itself is exactly soluble 
and if V2 is small enough so that it can be regarded as a perturbation. 
For example, Vi may be the coulomb interaction between an electron and 
a point nucleus of large Z, and V2 the additional electrostatic interaction 


1K. M. Watson, Phys. Rev. 88, 1163 (1952); M. Gell-Mann and M. L. Goldberger, 
Phys. Rev. 91, 398 (1953). 
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that arises because the nucleus is of finite size. As another example, V1 
may be the strong, short-range nuclear interaction between two protons, 
for which only a small number of partial waves need be calculated, and 
V. the much weaker coulomb interaction. 

We note first that Eqs. (37.2), (37.28), (37.14), and (37.27) are, 
respectively, — 


(B|T la) = (up,Vxat) = (xe7~,V ua) 
Xa = Ua t (E — Ho + U)OVXar (38.5) 
xe" = Ug + (E — Hy -— te) 1 V * xg? 


where E = E, = Ez. We may then substitute for us in the first of the 
expressions (38.5) for (8|T a) by using the equation 
x1e7 = Up + (FE — Ho — te) VE x187- 
which describes the scattering from Vi alone. The result is 
(B\T |x) = [up, (Vi + V2)xat] 
= [xe7, (Vi + V2)xat] 
— [(E — Ho — te) V8 xag™, (Vi + Vide! 
= [x1e?-, (Vi + V2) xa] 
— [xip?-, Vi(E — Ho — te)“*(Vi + V2) xa*] 
We now note that (E — Ho + te)“"(Vi + Vo)xat is equal to xat — Ua; 


and that (xis7-,Viva) is equal to (8|Ti\a), the T matrix element that 
describes scattering by Vi alone. We thus obtain 


(B|T\a) = (B|Tila) + (x167~,V 2xa*) (38.6) 


which is exact. 


DISTORTED WAVE BORN APPROXIMATION 


Equation (38.6) is most useful when V2 is small, in which case the second 
term is given to first order in V2 by replacing xa2* by x1at. This gives the 
T matrix element in the distorted wave Born approximation (DWBA):! 


(B|T a) ~ (B|Tila) + (x187~, V2x1e*) (38.7) 


The form of Eq. (38.7) is reasonable insofar as it gives the correction 
term, (8|(7' — 7T')|a), as a matrix element of the second potential between 
states that are distorted by the first potential. However, it may at first 
seem strange that this matrix element is to be calculated between an 
initial state that is asymptotically a plane plus an outgoing spherical 


1N. F. Mott and H.S. W. Massey, ‘“‘The Theory of Atomic Collisions,” p. 100 (Oxford. 
New York, 1933). 
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wave and a final state that is asymptotically a plane plus an ingoing 
spherical wave, rather than between two states of the outgoing type. 
This result can be made physically plausible in the following way.' 

The calculation is intended to yield the probability amplitude for a 
transition in which V2 causes scattering into the direction kg. However, 
if the final-state wave function were to be of the outgoing type, one would 
expect that a portion of this amplitude would be associated with scattering 
into directions other than Kg, since all directions are included in the out- 
going spherical wave. In the same way, a portion of the probability 
amplitude for scattering in the direction kg would then be included in the 
calculations for other final-state directions, since their outgoing waves 
would include contributions to scattering in the direction ks. The only 
way to avoid this situation is to choose the final-state wave function 
in such a way that it contains no outgoing spherical wave; this is possible 
only if the wave function is asymptotically a plane plus an ingoing 
spherical wave. 


PARTIAL WAVE ANALYSIS OF THE DWBA 


If V; and V2 are spherically symmetric, the distorted wave Born approx- 
imation can be expressed in terms of the scattering phase shifts 8; that 
are associated with V; alone. A simple extension of the work of Secs. 
19 and 37 shows that we can express x1.+(r) and Xis7~(r) as follows: 


Xwt(r) = C > (21 + 1)#Rj(r)P;(cos 6x) 
l=0 
sae Cex ee 3) + Fh) ] 
x7-(t) = C » (21 + 1)#R*(r) Pi(cos 65) 
lx 


a C [exp (tkg:r) + 1 FC, oa (38.8) 


f(KKe) = (2ik)— » (21 + 1)(e?* — 1)P,(cos On) 
t=O 
f?-(ie,jks) = (21k)! Y (2 + 1)(—1)' — e-**)Pi(cos 64) 
t=0 
on ihe =~ He a) 
z jp Sin (kr — ble + 1 
1 A. Sommerfeld, ‘“‘Wellenmechanik”’ (Ungar, New York) or “‘Atombau und Spek- 
trallinien,” vol. 2, footnote p. 457 (F. Vieweg & Sohn, Brunswick, Germany, 1939); 
G. Breit and H. A. Bethe, Phys. Rev. 98, 888 (1954). 


Ri(r) —> 
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Here, k = ka = kp, 0. is the angle between r and k,, and 4% is the angle 

between r and ks. Note that, for any angle 0, f(0) = f?-*( — 6), a rela- 

tion that agrees with the second of Eqs. (37.24) when V is real. 
Substitution from Eqs. (38.8) into the second term of (38.7) gives 


(x9?-,Vexsat) = 4n|Cl? Y (21+ 1)Pi(cos 8) f° Valr)Re(r)r? dr 
t=0 . 
(38.9) 


where 6 is the angle between ks and k,, and use has been made of the 
addition theorem for the Legendre polynomials, Eq. (32.10). Equation 
(38.9) is not very convenient for computational purposes, since it is an 
infinite series over 1 even when Vj is of short range so that only a few of 
the 6, are different from zero. This suggests that we subtract V2(r)7:?(kr) 
from each integrand and add in a compensating series. Since R,(r) is 
equal to ji(kr) if 6: = 0, this means that JV2(r)[Ki2(r) — ji(kr)\r? dr is 
zero unless 6; is different from zero. Further, the compensating series 
that has been added in can be summed by making use of the formula! 


Yl + D4t(kr) Pu(cos 6) = ae 
l=0 


q = 2k sin 36 (38.10) 


and this leads to the Born approximation amplitude for V2 alone. 

- Combination of Eqs. (37.21), (38.2), (38.7), (38.9), and (38.10) 
leads to the distorted wave Born approximation amplitude for spherically 
symmetric scattering :? 


fowsa(@) = fi(0) + fos(O) 
— 4 Y (al + 1)Pi(cos 6) f° Va(r)LRi%(r) — je2(er)|r* dr (88.11) 
t=O 


Here, fi is the exact scattering amplitude for Vi alone and fez is the 
Born approximation amplitude for Vz alone. The series represents the 
effect of the plane-wave distortion produced by V: on the first-order 
scattering caused by V2; the only J values that appear in this series are 
those for which 5; ~ 0. It should be noted that the integrals in (38.9) 
and (38.11) converge only if V2 falls off more rapidly than 1/r for large 
r, so that special care must be taken if V2 is an unscreened coulomb 
potential. 


1G. N. Watson, ‘Theory of Bessel Functions,’ 2d ed., p. 366 (Macmillan, New York, 
1945). 


2L. I. Schiff, Progr. Theoret. Phys., Suppl. Extra Number (1965) 400. 
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APPROXIMATE EXPRESSION FOR THE PHASE SHIFTS 


We now return to the case in which the potential is not divided into two 
parts. If V is spherically symmetric, we may substitute the expansions. 


xet(r) = C 2, (21 + 1)ie*R,(r)Pi(cos 62) 


us(r) = C > (21 + 1)ju(kr)Pi(cos 6) 


into (37.21), where 6, is the angle between r and k,, and 6g is the angle 
between r and Ks. We thus obtain the following as an exact expression 
for the scattering amplitude: 


fe) = — F >) (21 + 1)Pi(cos 6) i Vir)Rinjukr)r? dr (38.12) 
=0 
Comparison of Eqs. (19.11) and (38.12) shows that the phase shifts are 
given by 
ae Quk fe : 

e* sin 6) = — cs f V(r) Rilr)jilkr)r? dr (38.13) 
An approximate expression for 6), valid to first order in V, can be obtained 
by replacing the whole left side of (38.13) by 6, and R.(r) by ji(kr) in the 
integrand on the right side. This is the Born approximation for the 
phase shift: 

2pk 


ba = — Ef” V(r)it(der)r* dr (38.14) 


It may happen that the first few phase shifts are large enough so 
that (38.14) is not a good approximation and the remaining 6; are small 
but not negligible. In this case Eq. (19.11) can be written 


10) = fo(0) + a7 Y Ql + 1)(e% — 1 — 248n)Pi(cos 8) (38.15) 
1=0 


where use has been made of Eqs. (38.2), (38.10), and (38.14). Each 
term in the summation on the right side of (38.15) is zero through first 
order in V or 6), so that only the first few terms fail to vanish in the 
present case. 


SCATTERER WITH INTERNAL DEGREES OF FREEDOM 


We shall now extend the T matrix formalism to the case in which the 
scatterer has internal degrees of freedom, so that it can be excited during 
the scattering process. For example, an electron may be scattered by 
an atom that is initially in its ground state; after the collision, the atom 
may be left in its ground state (elastic scattering) or in an excited state 
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(inelastic scattering). However, in either case, the total energy of the 
electron and atom is conserved so that we are dealing with a stationary 
situation. We leave out of consideration the possibility that the incident 
electron changes places with the atomic electron; such exchange collisions 
will be discussed in Chap. 10. 

We divide the total hamiltonian into a part Ho that describes the 
internal motion of the scatterer together with the kinetic energy of the 
relative motion of incident particle and scatterer, and a part H’ that 
represents the interaction between the two. For example, in the scatter- 
ing of an electron (coordinate ri) by a hydrogen atom (atomic-electron 
coordinate rz), we have 


H =Hy)+H' 
sp a EY a Baie eee tee 
Res 5 a H!=—-~— (38.16) 


The distinction between the reduced mass yu of the electron-atom system 
and the electron mass m is unimportant here but could be significant 
in a nuclear collision. 

The eigenfunctions of Ho are specified by two parameters: 


H Waa = Eyatea 


2f- 2 
Uaa(t1,F2) at exp (ke ° r) Wa(T2) Fae = Towa Py a + €a 


(38.17) 


where w, and e, are a typical eigenfunction and eigenvalue of the hamil- 
tonian of the scatterer. The corresponding eigenfunction of H that 
corresponds to outgoing waves has the asymptotic form 


Xaat(¥1,T2) Wee. Cc [ exp (tke : 11) Wa(T2) 
+ Yi ry te rif(K,,b jKa,a)ws(t2) | (38.18) 
b 


Here k, is a vector whose direction is that of r1 and whose magnitude is 
given by the energy conservation relation (A?k,?/2u) + @ = Has. The 
amplitude f with 6 = a describes elastic scattering, and those with b ¥ a 
describe inelastic scattering. 


ELASTIC AND INELASTIC CROSS SECTIONS 
The wave function x,t satisfies a Lippmann-Schwinger equation which 
is an obvious generalization of (37.13): 
Xeot (FiF2) = UealF ite 
+ AOS SQ Gcaa(F (r450o501,%2)H’ (11,82) Xaat(F1,%2) d'ri d’r, (38.19) 
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The propagator associated with H» may be obtained by generalizing the 
second of Eqs. (37.8) and making use of the analysis that leads to (37.19): 


Ghowe(T 10250102) = S6Se(waa — war + te) ge (E482) Ups (L182) 


= Sete (12) 05 (r2) rar / (ks? — kg? + te)—} exp [ik - (tr) — 11)] d?kp 
est a Sowe(r2) we (ro) |r; — r\-? exp (ikelr; —11|) (38.20) 
; 
k= + & + ze (€4 — «)| 


where ky has been obtained from energy conservation. 

It is now a simple matter to extend the derivation of Eq. (37.21) 
to the present situation. Substitution of (38.20) into (38.19) and com- 
parison of its asymptotic form with (38.18) lead to 


f(k,,b jka,@) = TG ff ether conb'ap* (to) H' (11,82) Xeat (£1,720) d*r; d®re 


where @’ is the angle between k, or r, and r;. We now define a vector 
ks, which has magnitude k, and the direction of rj, to give 


H(kte,bikkaa) = — sarc75 (8017 laa) (38.21) 


(8b|T |«a) = ff Uge (¥1,¥2) A’ (11,82) Xaat (¥1,72) d®r; dre P 


Equations (38.21) are an obvious generalization of (37.21); further 
generalization to scatterers with several internal coordinates can easily 
be made. 

The elastic differential cross section is obtained from the asymptotic 
form of xaa* as in Sec. 18 and is equal to the square of the magnitude of 
the amplitude f for which b = a: 


o(Kke,a;k,a) = |f(Ke,a;k.,a)|? (38.22) 


In calculating the inelastic cross section, however, account must be taken 
of the fact that the incident flux is proportioral to the relative initial 
speed vq = hka/u while the scattered flux is proportional to v3. We thus 
obtain the same multiplying factor vg/v, that appeared in the derivation 
of Eq. (37.5). Thus the inelastic differential cross section is 


o(kj,b;k,,a) = Te Lf Oca ia) |? (38.23) 
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" ELECTRON SCATTERING FROM HYDROGEN 


- Equations (38.22) and (38.23) provide an exact description of electron- 

atom scattering, except for the neglect of exchange between the incident 

~ and atomic electrons, which is discussed in Sec. 43. As before, the Born 

approximation is obtained by replacing xaat in the T matrix element 

(38.21) by wea. The resulting integrals are not difficult to evaluate in 
the case of hydrogen. 

. Equations (38.16) and (38.17) give for the Born matrix element 


ff eGeCas22)H (e15t2)eo(Easfs) drs d?rs 


ry 


= |C|?e? ff exp (7q ° 11) (+ — =) wi (re) Wa(Fe) d3ri3 rz (38.24) 
‘12 


-It is convenient to evaluate the 71 integration first and, moreover, to 
integrate the 1/ri. and 1/r: parts separately. However, it must be 
recognized that these separate parts are not strictly convergent, since 
for large 71 (that is, for r1 >> 2), the behavior of the first integrand is like 
that of the second, which gives 


f exp (iq : Ti) ar, = 2 i sin qri dr; (38.25) 


Tr 


Because of the similar behavior for large ri, the difference between the 
two parts falls off like 1/r1? rather than like 1/71, so that, when both are 
taken together, there is in effect an extra power of 7; in the denominator 
of the integrand of (38.25), and the integral converges. The convenience 
of integrating the two parts separately can be retained by introducing 
_an integrating factor e~*" into the integrand of (38.24) and subsequently 
taking the limit a — 0: 


4 


fq: . 4r se , 
i exp (¢q 1) dr, = lim = i. sin qrie—*" dry = Z (38.26) 


Ti a- z 


With this preamble, we now write the 1/ri2 part of the r: integration 
in- (38.24) as . 


f =e (iq : 11) dr, = exp (iq F I) / exp a 7 o) d®p 


T12 
4 ‘ 
= ma exp (7q « fo) 
where we have put 9 = ri — rz and made use of (38.26). The Born 
matrix element (38.24) then becomes 


202 
aire / [exp (tq+ 12) — l]wp(r2)wa(Te) dre (38.27) 
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The w’s are the hydrogen-atom wave functions given in Eqs. (16.24) and 
have a simple enough structure so that the r. integration is elementary. 
For elastic scattering from the ground state the Born amplitude is 


b 4 2ue2 
fal) = — Fre + Agta’)? — 1) eis 
gq = 2k, sin 46 ke = ka 


Comparison of Eqs. (38.3) and (38.28) shows that the screened coulomb 
potential used at the beginning of this section does not provide a very 
good model for hydrogen, although it is better for heavier atoms. 

Similar results may be obtained for inelastic scattering. For the 
transition from the 18 ground state (n = 1, 1 = m = 0) to the 28 first 
excited state (n = 2, 1 = m = 0), the second term in the integrand of 
(38.27) is zero because of the orthogonality of the initial and final atomic 
wave functions, and the amplitude is 


8 VV 2uao2e? 
fa(@) = — 3 ¥/2nastet 


gq? =k? + kp? — 2kakg cos 0 kg? = k,? ~— —; 


(q?ao? + 8)-* 
(38.29) 


In the high-energy case, for which the Born approximation is best, kaao 
is large compared with unity, kg is nearly equal to ka, and q is approx- 
imately equal to 2k, sin 36. Then (38.29) shows that most of the scat- 
tering occurs for ga) < 1, which is equivalent to 6 < 1/k,zao. Beyond 
this, the differential cross section falls off with increasing angle approx- 
imately as cosec!? 36. This is a much more rapid decrease with angle 
than the cosec! $6 dependence obtained for elastic scattering from (38.28). 
The difference occurs because the nucleus contributes to the elastic scat- 
tering, and the orthogonality of initial and final wave functions means 
that only the much more diffuse electron cloud is effective in the inelastic 
case. 

Total cross sections are more easily obtained by integrating over q 
than over 6. This is accomplished by making use of the second of Eqs. 
(38.29) to replace sin 6 d0 by qdq/kakg. The result for elastic scattering 
in the high-energy case is 77/3k.*, which is about five times as large as 
the 1S — 28 total cross section, ($)!9(1281r/5k,”). Excitation to the 
states that have n = 2,1 = 1 (1S — 2P transitions) is most easily cal- 
culated by choosing the three final states (m = 0, +1) with their polar 
axis along the momentum transfer vector gq. Then the factors e+** that 
appear in the wave functions for m = +1 make these matrix elements 
vanish, and only the state with m = 0 is excited. This corresponds 
physically to the inability of the incident electron, whose momentum 
loss is along q, to exert a torque on the atomic electron about this axis. 
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The high-energy total cross section for this process turns out to be 
(3)12(576a/ka?) In 4keao. The appearance of the logarithm derives from 
an extra factor 1/q? in the differential cross section. Thus, in comparison 
with the LS — 2S scattering, the 1S — 2P differential scattering is more 
pronounced at small angles and the total scattering decreases less rapidly 
with increasing energy at high energies. 


PRODUCTION OF A CLOUD CHAMBER TRACK 


It seems surprising at first that a fast electron, which we can assume 
possesses a definite momentum (magnitude and direction) and hence 
cannot be localized in space, can produce a sharp track in a cloud chamber. 
This phenomenon may be considered from various points of view. In 
accordance with Ehrenfest’s theorem (Sec. 7), we can represent the elec- 
tron by a wave packet whose center of gravity moves like a classical 
particle. If the wavelength is short enough, the packet can be fairly 
small without spreading rapidly and will then interact only with atoms 
that lie close to the path of its center. This implies that the electron is 
represented by a superposition of plane waves and hence has an uncer- 
tainty in its momentum that enables its position to be sufficiently well 
defined. 

Another approach consists in describing the electron by a single 
plane wave and regarding its interaction with the first atom that it 
excites or ionizes as a position measurement that carries with it an 
uncertainty of the order of the atomic size. Thereafter, the electron 
is represented by a packet, like that described in the preceding paragraph, 
which is well localized if the first atom is large in comparison with the 
wavelength (ka > 1). 

We consider here in detail still a third description in which the 
electron and the atoms of the cloud chamber gas are treated as parts 
of a single system, so that we do not have to regard an atomic interaction 
as a position determination that changes the structure of the electron’s . 
wave function. To simplify matters, we assume that there are just two 
atoms present in their ground states and that their nuclei are far apart’ 
from each other and are fixed in space. We then calculate the cross 
section for a process in which both atoms are excited and the electron 
is scattered inelastically. For a fast incident electron, perturbation 
theory can be used; however, since the interactions of the electron with 
the two atoms are distinct, the process does not take place in first order, 
and it is necessary to go to second order. The calculation is interesting 
both because of the answer obtained and because it provides an instruc- 
tive example of second-order perturbation theory.! 


1W. Heisenberg, ‘‘The Physical Principles of the Quantum Theory,’”’ pp. 66-76 
(University of Chicago Press, Chicago, 1930). 
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The result of the calculation is that the cross section is very small 
unless the momentum vector of the incident electron is nearly parallel 
to the line that joins the two nuclei, and also unless. the initial and final 
electron momenta are nearly parallel. These three directions can have 
an angular spread in radians that is of the order of the ratio of the wave- 
length of the electron to the size of the atom. This is analogous to the 
result obtained in the preceding subsection for the inelastic collision of 
a fast electron with a hydrogen atom: The angular spread of the scattered 
electron was found to be roughly 1/kea@o. It is also in agreement with 
the wave-packet description of the process, since a localization of the 
electron by an atomic size a in a direction transverse to its motion pro- 
duces an uncertainty in the transverse-momentum component of amount 
h/a and an angular spread of order h/ap ~ 1/ka. 


SECOND-ORDER PERTURBATION THEORY 


The first-order perturbation or Born approximation for the T matrix 
element (38.21) was obtained by replacing xaat by vaa- In similar fashion, 
the second-order approximation is obtained by replacing xaat by the 
second term of the perturbation series that is analogous to Eq. (37.15), 
to obtain 


. , , i , 
j i il il if Uh, (14,03) A (£4,02)@ouca(01,42 01,82) H’ (41,82) 
Uaa(T1,¥2) dry d?r, d®r, d®re 


Substitution for the propagator can be made as in the first line of Eq. 
(38.20), where now the summations are over y and c rather than over 6 
and b. We thus obtain for the second-order contribution to the T matrix 
element? 


SySe(Eae — Eye + te)—(8b|H’|yc)(-ye|H’ aa) (38.30) 


This expression may be thought of as describing a two-step process 
in which the system makes a transition from the initial state aa to all 
possible intermediate states yc under the influence of the perturbation 
H’, and then a similar transition from yc to the final state 6b. Energy 
is conserved between initial and final states but need not be for the 
intermediate states. These have only a transient existence, and accord- 
ing to the uncertainty relation (3.3) it is impossible to determine the 
energy of such short-lived states with any precision. Thus it is not 
’ surprising that their contributions to the second-order T matrix element 
are inversely proportional to this energy discrepancy. Similar second- 


1 This was called ‘‘Golden Rule No. 1” by E. Fermi, ‘Nuclear Physics,” p. 148 
(University of Chicago Press, Chicago, 1950). 
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order matrix elements appeared in Eqs. (31.20) in connection with the 
perturbation of discrete energy levels. 

The notation underlying (38.30) must be generalized before it can 
be applied to the present situation. The hamiltonian for the electron 
and the two well-separated atoms is 


H = H,+ H’ 
2 
Ho a iy + Ay,(1) + H,(2) 


H’(r,1,2) = Hi(r,1) + H4(r,2) 


Here r is the electron coordinate, H, is the hamiltonian for the first atom 
by itself, 1 denotes all the internal coordinates of that atom, and H; is 
the interaction between the electron and the first atom; the second atom 
is described in similar fashion. The expression (38.30) may then be 
written 


$755, 9: Basa: = Five ss te) (Bb 1b2| H" |-yeice)(ye1C2| HH |aaiae) 


It is apparent that there are two possible groups of intermediate states 
7C1C2: that in which the first atom has made a transition from state a1 
to state b, under the influence of H; (c, = 61) while the second atom has 
not changed its state (ce = a2), and that in which c, = a, while the 
second atom has changed its state from a. to 62 under the influence 
of Hj (co = be). Thus the second-order matrix element may be written | 


S7(Eao, — Ey, + te)—\(Bb2|H3|va2)(ybi| Hj |aa1) 
+ S1(Eea, — Ey, + te)- 1(8b;|H|ya1)(ybo|H5laaz) (38.31) 


EVALUATION OF THE SECOND-ORDER MATRIX ELEMENT 


We now evaluate the first summation in (38.31) explicitly and later indi- 
cate the changes that are to be made in the result to obtain the second 
summation. The factors that appear there may be written in analogy 
with (38.24) and (38.20): 


(ybi|Hj\aa1) = |C|2ff exp (—7ak,-r + tk. - 1) 
 Hy(r,1) we (lL) we, (1) d?r dri 


(8b2|Ho|ya2) = |C|?ff exp (—ikp- 1’ + ik, - 1’) 
x HY(r’,2) wz, (2)wa,(2) d8r’ dre 


(38.32) 


2 
Eaa, — Ey, + te = = (x? faa i? + te) 
2m 


e=k,? — ar €a,) 
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Then, in analogy with the derivation of Eq. (37.18),. the summation over 
vy can be carried out to give 


— mit 


pe ff Ho — £1 exp (Gale! — x] ~ dheg e+ they +2) dr! dtr 
X Swe (2) Ho (t’,2)wa,(2) drofwe(1)H4(r,L)we,(1) dri (38.33) 


The nucleus of the first atom can, without loss of generality, be 
placed at the origin, and that of the second atom at the point R. Then 
the integral over the internal coordinates of the first atom will be very 
small unless r is close to zero, and the corresponding integral for the 
second atom will be very small unless r’ is close to R. We therefore 
write 


F,(r) = SwF (1) Hi (2,1) wa, (1) dri ; 
F2(r’ — R) = fug(2)H3(0’,2) wa,(2) dre 


These F’s are very small except when their arguments differ from zero 
by distances of the order of the size of the atom. We putr”’ =r’ — R, 
so that practically all the contribution to (38.33) comes from small values 
of randr’’. We can then obtain the leading term of (38.33) for large R 
by approximating 


Rr” _Rer 
R 


[ro — rl = |(R+r’—rl = R+ jr’ — 2|-1 = RO 


It is convenient to define a vector « that has the magnitude « and the 
direction of R. Then substitution into (38.33) gives 


- a exp [i(% — i) -R] f Fa(e”) exp [i(% — kp) - 2] d*r” 


x [Fm exp [7(k. — x) +r] dr (38.34) 


In similar fashion, the second summation in (38.31) becomes 


OR Fe exP (ele! + Ka) +R] f Fale”) exp [—a(w! + hep) 1] d2r” 


x if Fyx(r) exp [t(ka + «’) +> r] d’r (88.35) 


where x’ is a vector in the direction of R whose magnitude is given by 
the last of Eqs. (38.32) with «, — ea, replaced by 6, — €a,. 

The differential cross section is obtained by substituting the sum 
of (38.34) and (38.35) in place of the 7’ matrix element in the scattering 
amplitude (38.21) and then into Eq. (38.23). Conservation of energy 
requires that kg? = k,? — (2m/h?)(e, + &, — €a, — €a,)- 
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DISCUSSION OF THE CROSS SECTION 


The integrals that appear in (38.34) and (38.35) have the characteristic 
structure associated with the perturbation treatment of collision problems. 
They are very small unless the propagation vector that appears in the 
exponent of the integrand has a magnitude that is of order 1/a or less, 
where a is a typical linear dimension of the atom (F significantly different 
from zero). It follows that (38.34) is appreciable only when the vectors 
k,, «x, and kg are nearly equal in magnitude and direction. Because of the 
assumption that the incident electron is fast, the magnitudes are very 
nearly equal in any event; also, the direction of x is the same as that of R. 
Thus the cross section that arises from (38.34) is appreciable only when 
the vectors R and kg are nearly parallel to x.. The permitted angular 
deviation from parallelism is easily seen to be of order 1/k.a. 

Similarly, it follows that (38.35) is appreciable only when x’, and 
hence R, is nearly antiparallel to both k, and kg, in which case the latter 
two vectors are nearly parallel to each other. 

The two terms together show that excitation of both atoms occurs 
with appreciable probability only when the line joining the two atoms is 
nearly parallel to the direction of the incident electron. There is, how- 
ever, no further restriction on the location of the atoms, since the choice of 
origin of the coordinate system is arbitrary. Thus, although the cloud 
chamber track has a well-defined orientation in space, it may appear 
anywhere if the incident electron is described by a plane wave. It is 
apparent also that the cross section falls off inversely as the square of the 
distance R between the two atoms, as would be expected. __ 


EIKONAL APPROXIMATION 


The perturbation methods developed earlier in this section replace x.+ in 
the 7 matrix element (37.2) by the first one or two terms on the right side 
of Eq. (37.15). A quite different approximation for x.+ can also be found 
along the lines of the WKB approximation, which was discussed in Sec. 34; 
substitution into (37.2) then yields the eikonal approximation.' 

The WKB approximation starts from the exact equations (34.2) and 
expands S(r), the logarithm of the wave function, in powers of / in the one- 
dimensional case. The leading term, So, approximates the phase of the 
wave function, and the next term, Si, approximates the magnitude. A 
rough validity criterion was obtained in (34.9); in three dimensions it 
corresponds to the requirement that 


Kl kG) = +5 (2B. — VI}! 


1G. Moliére, Z. Naturforsch. 2, 133 (1947). The earlier history of this approach, in 
connection with optics and classical mechanics, is reviewed by M. Born and E. Wolf, 
“Principles of Optics,” pp. 111, 132 (Pergamon Press, New York, 1959). 


if 
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Physically, this means that the potential energy changes so slowly that 
the local momentum Ak(r) is sensibly constant over many wavelengths. 

For the present application, we shall assume in addition that S,; can 
be neglected in comparison with Sy and that V(r) is everywhere small in 
comparison with E,. We thus have a high-energy approximation in 
which V(r) does not vary rapidly. It might therefore be thought at first 
that the end result is equivalent to the Born approximation. However, 
Eq. (38.4) shows that, at high energies, | (hv)? ie V(r) dr | must be 
small in comparison with unity in order for the Born approximation to be 
valid. This parameter can be larger than unity for a weak potential of 
long range even though V < E,; the eikonal approximation does not 
limit its value and so is superior to the Born approximation. 

We therefore put 


xet(t) ~ C exp) (wg) = QulB, — Vix)] = WEAR) 


h 
(38.36) 


The solution of this differential equation for So when V can be neglected is 
So(r) = hk,-1r, where k, is any vector with magnitude k, = (2uE./h?)}. 
In this case xt is simply u., as would be expected, and the Born approxi- 
mation is obtained. It is convenient to choose the positive z axis to lie 
along the direction of k., so that So(r) = kz when V can be neglected. 

When VV is finite but small in comparison with E,, we try the follow- 
ing approximate solution of (38.36): 


So(t) ~ kez — + [i Veeye) ae! (38.37) 


where the relative velocity is ve = hk./u and 29 is a constant to be deter- 
mined shortly; the integral is assumed to converge for allr and Zz. Sub- 
stitution of (38.37) into the second of Eqs. (38.36) shows that the latter is 
satisfied apart from terms of order n.V?/E,; these may be neglected in 
comparison with 2uV since V << E,. We thus obtain 


Xat(r) = C exp (i E _ a fe V (2,y,2’) «|| (38.38) 

Within the limitations of the approximations introduced above, 
(38.38) is a solution of the wave equation but is not necessarily of the out- 
going type. We now show that x.+ is obtained according as 2o is chosen 
to be #0. Tosee this, we first write (38.38) in the form of a plane plus 
a scattered wave: 


Ua(r) + Cetka* {exp |- i f. V (x,y,2') a | — | 
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This scattered wave vanishes outside of a tube whose diameter is that of 
the potential and which is centered on thez axis. If we choose z = — ~, 
the asymptotic scattered wave also vanishes in the negative-z half of the 
tube and is a constant multiple of e«* in the positive-z half so that it 
represents an outgoing wave. Similarly, if we choose 2 = +, the 
asymptotic scattered wave vanishes in the positive-z half of the tube and 
is an ingoing wave in the negative-z half. 

As remarked above, the eikonal approximation goes beyond the 
Born approximation in that the phase difference between x.t and wa is 
taken into account in lowest order at high energies. This shows that the 
Born approximation is valid at high energies so long as this phase differ- 
ence is small in comparison with unity: 


[ iv.) ies V(2,y,2’) de’ | «1 for all r (38.39) 


This validity criterion is more general than that inferred from (38.4) since 
it is applicable when V(r) is not spherically symmetric. The use of xat 
in place of uz corresponds to neglect of the change in direction of the 
velocity of the particle as it moves through the potential and to inclusion 
of the effect of change in velocity magnitude insofar as it affects the phase 
but not the amplitude of the wave function. 


SCATTERING AMPLITUDE AND CROSS SECTION 
Substitution of (38.38) into (37.20) gives for the scattering amplitude 


Miko ita) ~~ gh [ Veedexp {i[a-e — ga, f*, Vewe) ae |] a 


It can be shown that this expression is valid only so long as the scattering 
angle @ is small in comparison with (k.a)—*, where a is the range of the 
potential.!. For this range of 6, q- r can be replaced by g.t + qyy; this is 
because g,% and q,y are <k.a8, which may be much larger than unity, 
whereas q.2 < k,a6?, which is much smaller than unity. The integration 
over z can then be performed and leads to 


f (Kp, ka) iad Bs f. ee et(gett+ayy) 


{1 — exp |- Aa me V(x,y,2) az|| dz dy (38.40) 


1L. I. Schiff, Phys. Rev. 108, 443 (1956). Similar expressions that are valid for larger 
angles are derived in this paper and more comprehensively by D. 8. Saxon and L. I. 
Schiff, Nuovo Cimento 6, 614 (1957). For a detailed review of the small-angle case 
and derivation of additional results, see R. J. Glauber, in W. E. Brittin and L. G. 
Dunham (eds.), ‘Lectures in Theoretical Physics,” vol. I, p. 315 (Interscience, New 
York, 1959). 
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The total cross section can be obtained from (88.40) with the help of the 
optical theorem (20.13): 


aioe = 5 Tm [f(lteske)] ~ 2 ff 


exp | - i = V1(2,y,2) az| cos E f. V r(x,y,2) az|} dx dy 
(38.41) 


As in Sec. 20, we have put V = Vr — 2V1, where Vz is real and V, is real 
and positive. 

It is interesting to see if this expression for the total cross section is 
consistent with plausible estimates for the total elastic cross section o. and 
for the sum of the total absorption and inelastic cross sections o.p,. It is 
reasonable to suppose that, even though (38.40) is not valid at large 
angles, the amplitude gets very small there, and most of the contribution 
to a comes from small angles. We can then put gz ~ k,8 cos ¢, qy ~ 
k,8 sin ¢, and replace the solid angle element as follows: sin 6 d0d¢ = 
6d0d¢ = dqzdq,/ka?. In the interest of simplicity, we approximate 
further by extending the limits on g, and gy to +o. We thus obtain 


1 - - 
oa = pa ff, Lio ke)? dae ay 


ow i ee (1 — 2exp [- mf. Vir(x,y,2) az| 


) 2 ro) 
X cos E on Var(z,y,2) az| + exp [- om f. Vr(2,y,2) a|| dx dy 
(38.42) 


where use has been made of Eq. (11.10). In similar fashion, Eq. (20.3) 
can be used to calculate o,,, if we substitute for y from Eq. (38.38) with 
C = land z = —o: 


coe ~ ef, ff, Views) 
exp | - ie if V1(2,y,2’) a | dx dy dz 


= be f-. {2 — exp |- — i V1(x,y,2) az|| dx dy 


(38.43) 


It is evident that Eqs. (38.41) to (38.43) are indeed consistent with each 
other. 

All equations (38.40) to (38.43) have the form of integrals over the 
_ transverse coordinates x and y. Each 2, y pair may be thought of as 
defining a classical trajectory; to our approximation it is a straight line 
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parallel to the z axis, with ¢mpact parameter b = (x? + y’)!, since, as 
remarked at the end of the preceding subsection, the change in direction 
of the particle velocity as it moves through the potential has been neg- 
lected. The scattering amplitude and cross sections are then sums of 
contributions from all possible impact parameters. In the case in which 


V is symmetric about the z axis, i) / oe dx dy can be replaced by 
2r if b db; for example, (38.40) then becomes 


J(0) = ike f,° Jo (go) {1 - exp] = 55 [7 v2) ae]! bat 
: q = kad (38.44) 


where J is the ordinary Bessel function of order zero. 


PERFECT ABSORBER 
A simple example of the foregoing results is provided by a perfect absorber: 
an object for which, even though V; < Ea, (hva)=! J ae V1(x,y,z) dz is so 


large that the exponential is negligibly small compared with unity for all 
trajectories z, y that pass through the object. In this case it follows from 
Eas. (38.41) to (38.43) that 


Tel ~ Tabs ~ A Ttor ~ 2A (38.45) 


where A is the cross-sectional area of the object. This result is closely 
related to that noted in connection with Eq. (19.24). In that case there 
was no absorption, but the elastic scattering could be divided into equal 
contributions za? arising from spherically symmetric scattering by the 
rigid sphere of radius a and from a diffraction peak in the forward direc- 
tion. In the present case there is no large-angle scattering, but the elastic 
diffraction scattering is equal to the absorption. 

The angular dependence of the diffraction scattering is easily 
obtained if the absorber is a sphere of radius a. Equation (38.44) then 
becomes 


f(0) = tka ip * Jolkab6)b db = “ J1(kot8) 


The differential elastic scattering cross section |f(@)|? has a peak at 6 = 0 
with height +k,2a4 and angular breadth of order 1/k.a. The total elastic 


cross section may again be estimated by extending the upper limit on @ to . 


2, to obtain 
2 ° ms d 
Ga © 20 f |f(0)|20 d@ = 2na? if J (2) = = ra* 


in agreement with (38.45). 
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39D ANALYTIC PROPERTIES AND DISPERSION RELATIONS 


Stationary collision theory, developed and applied in the two preceding 
sections, deals with elements of the scattering and transition matrices for 
particular values of the energy. Although most of the discussion was 
concerned with special situations, it was pointed out that useful general 
results could be obtained from the assumed symmetry and unitarity of 
the S matrix. General results of a different kind can be obtained from the 
assumptions that the S and T matrix elements are analytic functions of 
the energy and of other physical parameters, such as scattering angle and 
angular momentum, when these parameters are regarded as complex 
variables. These results are expressed in the form of dispersion relations. 
A typical disperison relation connects 7’ matrix elements at different 
physical values of the energy, that is, at real rather than complex energies. 
The assumed analytic dependence of the 7 matrix element on the energy 
even when the energy is complex (and hence unphysical) provides a bridge 
between physical values by way of the complex-energy plane. 

Thus a dispersion relation can be derived only after the analytic 
behavior of the T matrix element has been established or conjectured, so 
that the locations of poles and branch cuts and the asymptotic dependence 
are believed to be known. As originally derived in optics by Kronig and 
Kramers,! the analytic behavior underlying the dispersion relation was 
inferred from causality: the statement that a light signal has a limiting 
speed c, so that the scattered electromagnetic radiation cannot outrun the 
incident wave. Causality connects events that occur at different times and 
hence relates Fourier components of the electromagnetic field that corre- 
spond to different frequencies. But in nonrelativistic quantum mechan- 
ics there is no limiting speed and hence no causality of this kind, so that it 
might at first be thought that there are no dispersion relations. However, 
solutions of the Schrédinger equation that correspond to different values 
of the energy or the angular momentum are in fact connected by the 
assumption that the potential energy is independent of these parameters 
(or in more complicated situations has a specified dependence on them). 

In this section we first consider the analytic properties of a diagonal 
element of the S matrix in the angular momentum representation when 
the potential energy is spherically symmetric. From Eq. (37.36), this is 
equal to e?:), where the /th phase shift depends on the energy or on k. 
Some useful results, including dispersion relations, can be derived once the 
analytic behavior is established. We then obtain a dispersion relation 
for the forward scattering amplitude, which is also valid when the poten- 
tial is spherically symmetric. Through the optical theorem (Eq. (20.23)], 


1See, for example, R. G. Newton, “Scattering Theory of Waves and Particles,” 
sec. 4.2 (McGraw-Hill, New York, 1966). 
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this relates the forward scattering differential cross section (which requires 
_ knowledge of both the real and imaginary parts of the forward amplitude) 
to an integral over the energy of the total cross section. 

No attempt is made at mathematical rigor, and some needed results 
are stated without proof. More detailed derivations are given in the 
references cited below and in the original papers to which they in turn 
refer. 


RADIAL SOLUTIONS! 


The regular solution of the radical wave equation (19.2) can be specified 
in terms of its boundary condition near the origin: 


pith 


oi(k,r) =a @+ pn . (39.1) 


¢:(k,r) is an even function of k since it depends on k only through k?, and it 
is real when k is real since V(r) is assumed to be real. We work with 
¢:(k,r), rather than with the proportional solution rR,(r) that was intro- 
duced in Sec.:19, because the boundary condition (39.1) is independent of 
k. This means that use can be made of a theorem of Poincaré, which 
applies to the solution of an ordinary differential equation like (19.2) with 
the property that the dependence of the equation on some parameter k is 
only through an entire function? of k, in this case k?. According to this 
theorem, a solution specified by a boundary condition that is independent 
of k is itself an entire function of k. Thus, for fixed r, ¢,(k,r) is an entire 
function of k whereas rR,(r) need not be. 

¢i(k,r) is a standing wave and has the asymptotic form of a sinusoi- 
dal function of kr. Since it is convenient to have traveling wave solutions 
of (19.2) available as well, we define another solution f;(k,r) through an 
asymptotic boundary condition: 


filk,r) > ve (39.2) 


This is evidently irregular at the origin, where it is proportional to rn;(kr) 
and hence behaves like7~'. [For! = 0, we follow the procedure of Sec. 15 


1 For a fuller account of the analytic properties of partial wave amplitudes, see R. G. 
Newton, J. Math. Phys. 1, 319 (1960) and “‘Scattering Theory of Waves and Particles,” 
chap. 12 (McGraw-Hill, New York, 1966); M. L. Goldberger and K. M. Watson, 
“Collision Theory,” sec. 6.5 (Wiley, New York, 1964); V. De Alfaro and T. Regge, 
“Potential Scattering,’ chaps. 4, 5 (Wiley, New York, 1965). The notation used in 
Newton’s book is slightly different from that followed here or in the other references 
cited. 

2 An entire function or an integral function of k is an analytic function of & that is 
regular for all finite values of k. 
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and regard the solution that behaves like rjo(kr) « r at the origin as regu- 
lar, and the solution that behaves like rno(kr) er constant as irregular. ] 


As remarked in Sec. 19, the asymptotic form (39.2) requires that 
V(r) fall off faster than 1/r for large r. It is also easy to see that the 
power dependence of (39.1) requires that V(r) become infinite less rapidly 
than 1/r? as r—0. We shall actually assume the more restrictive 
conditions 


ff lV@lar< » and [Priv@lar< © (39.3) 


throughout this section and occasionally limit the asymptotic behavior of 
V(r) even more severely. 

Except at k = 0, a third solution f;(—k,r) is linearly independent of 
fi(k,r) and has the asymptotic form 


fi(—k,r) =< deter (39.4) 


Thus ¢,(k,r) can be expressed as a linear combination of f;(+k,r). This 
corresponds to the procedure, followed in Sec. 19, of writing rR,(r) in the 
asymptotic region as a linear combination of e+". The ratio between the 
coefficients of these two terms is essentially the S matrix element (37.36): 


Si(k) = ei) . (39.5) 


Thus we can study the analytic properties of S:(k) by studying those of 
fi(k,r) and making use of the fact that ¢,(k,r) is an entire function of k. 
The analytic properties of f:(k,r) can be established by working with 
an integral equation that it satisfies, as is done in the references cited 
above. The conclusion is that, for fixed r > 0, fi(k,r) is an analytic func- 
tion of k that is regular in the open lower half of the complex & plane; it is 
also continuous onto the real axis except at k = 0. If the potential 
satisfies not only (39.3) but also the condition . 


if r|V(r)l|e" dr < @ np>O (39.6) 


the region of regularity also includes the strip in the upper half of the k 
plane for which Im k < gu, except for a pole of orderlatk = 0. Further, 
if V(r) vanishes asymptotically faster than any exponential, so that (39.6) 
is satisfied for arbitrarily large yu, then k’f,(k,r) is an entire function of k for 
all fixed r > 0. This will occur, for example, if V(r) falls off like a 
gaussian function for large r or vanishes identically for r greater than 
some finite value. 

It follows from (39.2), (39.4), and the reality of the differential 
equation (19.2) that 


[fil —kr)]* = (—1)'filk,r) (39.7) 
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forrealk. If now we move away from the real axis but stay in a region in 
which f,(k,r) is analytic, we note that ff satisfies the same equation with k* 
that f; satisfies with k. The boundary conditions (39.2) and (39.4) are 
then consistent with each other for complex k if we replace (39.7) with 


[fi(—k*,r)]* = (—1)'fi(k,r) (39.8) 


In similar fashion, the fact that ¢:(k,r) is an entire function of k that is 
real for real k implies, by Schwarz’s reflection principle, that 


[oi(k*,r)]* = dilk,r) (39.9) 
for all finite complex k. 


JOST FUNCTION 


As remarked just after (39.4), ¢:(k,r) can be expressed as a linear combina- 
tion of f;(+k,r). We write this combination in the form 


di(k,r) = gtk Nf (—k) filkyr) — (—1) fk) fi(—k,r)] (39.10) 


which is consistent with the requirement that ¢:(k,r) be an even function 
of k. The coefficient f,(k) is called the Jost function! and is closely related 
to Si(k). This relation is obtained by comparing the asymptotic form 
of (39.10), 


oulb) 2 5 (F) Wi —e* — (Der (39.11) 
with the asymptotic form of rR,(r) given by (19.8): 

rRi(r) — - t+ eitife—strat, (1 gir tb] (39.12) 
It follows from this comparison and the definition (39.5) that 


Si(k) = e%h® = io (39.13) 


As might be expected from the form of Eq. (39.13), the analytic properties 
of the S matrix element are somewhat more complicated than those of the 
Jost function; for example, S,(k) generally has poles both at the poles of 
fi(k) and at the zeros of f;(—k). 

The complex conjugate of Eq. (39.10) is 


[oi(k,r)]* = —ge(k*)— OU f(—k)filkyr) — (—I) Rk) fi(—k,r)]* 


On substituting k* for k throughout this equation and making use of 
1R. Jost, Helv. Phys. Acta 20, 256 (1947). 
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Eqs. (39.8) and (39.9), we find that 
i(—k*))* = file) , (39.14) 


For real k, Eqs. (39.13) and (39.14) show that |S,(k)| = 1, so that the S 
matrix is unitary and 6,(k) is real, as is expected for a real potential. It 
also follows that 


ilk) = |frlk) le a (39.15) 


for real k. 

We now wish to make use of our knowledge of the analytic properties 
of ¢:(k,r) and fi(k,r) to determine the analytic properties of fi(k). To 
accomplish this, we use Eq. (39.10), which defines f,(k), to obtain an 
explicit expression for f,(k) in terms of ¢:(k,r) and fi(k,r). We note that, 
for a differential equation of the form (19.2), the wronskian 


WLfCk,r) ea(kyr)] = fle) & bulksr) — ou(hsr) S Silks) 


of any two solutions of the equation with the same k is independent of r. 
Substitution from (39.10) gives 


Wiilkyr),d(k,r)] = #(—k) i(k) WLC, ful —k,r)] 


The wronskian on the right side can be evaluated for large r by using the 
asymptotic forms (39.2) and (39.4); it leads to 


filk) = kK Wifi(k,r), dr(k,r)] (39.16) 


The radial derivatives of f,(k,r) and ¢;(k,r) have the same analytic prop- 
erties with respect.to k as the function themselves. Then, since ¢,(k,r) is 
an entire function of k, Eq. (39.16) shows that f,(k) has the same analytic 
properties as k'f,(k,r). For example, if V(r) has a finite range, f:(k) is an 
entire function of k. 


ENHANCEMENT FACTOR 

Equation (39.15) shows that the argument of the complex Jost function is 
the scattering phase shift. It is of interest to note that the magnitude 
\f:(k)| also has a simple physical interpretation. Comparison of (39.11) 
and (39.12) shows that 


rRi(r) = oi(k,r) (39.17) 


hi RB) B) 

Thus, as r approaches zero, (39.1) shows that rRi(r) approaches 
r(kr)'/fi(—k)(2l + 1)!! whereas (15.7) shows that the free solution 7j:(kr) 
approaches r(kr)'/(21 + 1)!!. Thus the magnitude of the wave function 
in the neighborhood of the origin is multiplied by the factor 1/|f:(k)| on 
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account of the scattering potential. This enhancement factor is of impor- 
tance in computing the effect of interaction among particles in the final 
state of scattering or production processes.! 


JOST FUNCTION FOR LARGE || 

The asymptotic properties of f,(k) are established in the references cited 
earlier and will not be derived here. On the real axis and in the lower 
half of the complex k plane, 


fk) ——9 1 + Bs ai fe V(r) dr Im k <0 (39.18) 


i> = 
where it is assumed that the integral is finite. 
Since V(r) is real, Eqs. (39.15) and (39.18) show that for real k 


(a) > gl VOe (39.19) 


This may be compared with the Born approximation for the phase shift, 
given in Eq. (38.14). For fixed J in the limit of very large k, Eq. (15.8) 
shows that 7;2(kr) may be approximated by 1/2k?r? for all except the small- 
est values of r. It then follows that (38.14) and (39.19) are in agreement. 


BOUND STATES 


A bound state, if it exists, is described by a regular solution ¢:(k,r) for 
which k? is real and negative and the r dependence for large r is a decreas- 
ing exponential. This means that one of the two terms in the asymptotic 
expression (39.11) must be zero; it is easily seen that the same result is 
obtained in both cases. With & = +7, we find that a bound state is 
given by 


oi( + tx,7) > a(— Kn) ile" fi(—%x) = 0 ck >0 
(39.20) 


provided that k = % is in a region of analyticity of f,(k) that is connected 
with the real axis. The energy of such a state is —h?x?/2Qu. 

We conclude in this case that a zero of the Jost function on the 
negative imaginary k axis corresponds to a bound state. Further, fi(7x) 
cannot vanish, since in accordance with (39.20) this would cause ¢;(7x,r) 
to be identically zero; thus the S matrix element (39.13) has a zero at 
k = —%x and a pole at k = ix. It can be shown that each such zero and 
pole are simple. Although all negative imaginary zeros of the Jost func- 
tion give bound states, this is not true of all negative imaginary zeros or 


1 See, for example, J. Gillespie, ‘‘Final-State Interactions,’ p. 17 (Holden-Day, San 
Francisco, 1964). 
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positive imaginary poles of S;(k); an example of this will be given below in 
connection with the theory of the effective range. 


DISPERSION RELATIONS FOR THE JOST FUNCTION 


The function fi(k) — 1 is analytic for Imk < 0 if the potential satisfies 
the conditions (39.3) ; also, in accordance with (39.18), it is of order 1/k as 
|k| >» «©. We can therefore make use of Cauchy’s theorem to write 

; ’ 

Ak) -1=- 2 f MO“lw  Imk<o (39.21) 
where the contour C is shown in Fig. 36a. Since the integral over the 
semicircle vanishes in the limit in which its radius approaches infinity, the 
contour integral can be replaced by an integral over the entire real axis. 
A limit may then be taken in which k approaches the real axis from below, 
in accordance with Fig. 360. For a vanishingly small radius of the semi- 
circle, the integral along the real axis is the principal value, and the 
integral over the semicircle is —w7[fi(k) — 1]. Thus (39.21) becomes 


fi(k) -1= ip ft ie) — I ae : k real (39.22) 


Equations that relate the real and imaginary parts of f:(k) — 1 can 
be found by taking the real and imaginary parts of Eq. (39.22): 


i) , — 
Re [fi(k) — 1] = —+p [7 AOU) — Uy 
acre ee 
Im [fi(k) — 1] = “Pp | ee aoe ae 
Since they obey the relations (39.23), the real and imaginary parts of 


fi(k) — 1 are said to be Hilbert transforms of each other. The dispersion 
relations (39.23) can be used as the basis of an approximate calculation, 


k’ plane 


(a) (6) 


Fig. 36 Contours for the evaluation of Eq. (39.21). 
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by starting with the Born expression (38.14) for the phase shift. In 
accordance with the discussion below Eq. (39.17), fi(k) = 1 in the absence 
of a scattering potential, so that Im [f:(k) — 1] = dai(k) + 0(V?). This 
can be substituted into the first of Eqs. (39.23) to obtain an approximate 
expression for Re [f:(k) — 1]. Then, from (39.15), the ratio of the imagi- 
nary to the real part of f:(k) gives an improved expression for tan 6:(k).} 


DISPERSION RELATION FOR In /;(k) 

Another dispersion relation can be obtained by starting with In fi(k) 
instead of fi(k) — 1 in Eq. (89.21). If we assume for the moment that 
there are no bound states, then In f;(k) is also analytic in the lower half 
plane and is of order 1/k as |k| > ©. From (39.15), we have that 
In fi(k) = In |fi(k)| + 78:(k), where 6,(k) is assumed to be defined so that 
it vanishes at k = +, in agreement with (38.14) and (39.19). Then 
the first of Eqs. (39.23) becomes 


In [fk] = — 2p f* O a 


We add 78:(k) to both sides of this equation; then the left side is In f;(k), 
and 76:(k) can be replaced on the right side by 


pe 1 5:(k’) , 
= I Nn dk 


(39.24) 


"k’ —k 
where the contour C’ is the small semicircle in Fig. 36. We thus obtain 
st 5(k’) 1, 
In fit) = —= [,,, dk 


where C”’ is the full contour shown in Fig. 36b. This can also be written 
as an integral along the entire real axis: 


Inf) = — Ff pre 


where the limit e — 0+ is understood. 


dk’ (39.25) 


EFFECT OF BOUND STATES 

Thus far we have assumed that there are no bound states. If there are 
bound states with energies E,, < 0, they produce zeros of f,(k) at k = —tkn, 
where kn = +(—2yE,/h?)!. These zeros of f:(k) give rise to branch points 
of In fi(k), and the foregoing analysis fails. We can, however, define a 
reduced Jost function 


fulkt) = Hulk) rs AL 


1J. J. Giambiagi and T. W. B. Kibble, Ann. Phys. (N.Y.) 7, 39 (1959). 


(39.26) 
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such that In f,(k) has the same analytic and asymptotic. behavior that 
In f:(k) has in the absence of bound states. In places of (39.15), we now 
have that fi(k) = |f,(k)|e*%™, where |f,(k)| = |f:(k)| and 


5(k) = 6:(k) + id E (i+! + =) -1 n(a Bs *)| (39.27) 


Thus the dispersion relation (39.25) is valid when bars are placed over 
ti and 61. 

We therefore substitute (39.27) into the integrand of (39.25). Each 
of the first group of logarithms introduces a branch point at k’ = —7, and 
at k’ = 0, which can be connected by acut. Thus the integral along the 
real axis can be closed by an infinite semicircle in the upper half plane, 
which contributes nothing. The branch point at k’ = 0 causes no diffi- 
culty with the integration along the real axis, since this integration can be 
regarded as a principal value at k’ = 0 right from the beginning, and the 
logarithm gives no contribution from the infinitesimal neighborhood of 
the origin. It follows that each of the first group of logarithms integrates 
to zero since the pole at k’ = k — ze lies outside the contour. The branch 
points for the second group of logarithms are at zx, and 0, so that the con- 
tour is closed by an infinite semicircle in the lower half plane and contains 
the pole at k — 1. 

We conclude, then, that Eq. (39.25), with bars placed over f; and 61, 
reads as follows: 


1 fe 5:(k’) ; 1Kn 
In fuk) = —= f° pe eer +2) In 1-= 
Substitution for f, in terms of f; from (39.26) gives! 


In fuk) = 2 f°) ae + > In (+8 2 (39.28) 


The corresponding a. that is suinlowous to (39.24) is 


in (fi) Pf; i(k") dk! +> n( ie ) (39.29) 


Equation (39.28) makes it possible to calculate f:(k), and hence also.S,(k), 
in terms of the scattering phase shift and bound-state energies —h?x,?/2y; 
similarly, the enhancement factor 1/|fi(k)| can be calculated from 
(39.29). 


1R. Jost and W. Kohn, Phys. Rev. 87, 977 (1952); R. G. Newton, J. Math. Phys. 1, 
319 (1960), Sec. 5. 
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In using these equations, we note from (39.14) and (39.15) that, 
for real k, 


eh) = eh) if fi(k) # 0 (39.30) 


Now it is convenient to continue our earlier convention that 5:(/) vanishes 
atk = +; this is in agreement with Eqs. (38.14) and (39.19). It then 
follows from ‘(39. 30) that 6(—k) = —6,(k) forrealk. However, this does 
not necessarily imply that 6:(0) = 0, since the limiting values of 6:(k) as k 
approaches zero from the two sides need not be the same. We therefore 
extend our convention as follows: 


6(—k) = —6:(k) for real k # 0 51(k) aa 0 (39.31) 


LEVINSON’S THEOREM 


There is an interesting relation between the number 7, of bound states of 
given / and the corresponding phase shift 5,(0) at zero energy, defined as 
the limit from positive k. We assume for the moment that f,(0) ~ 0 and 
form the contour integral 


> ii ae dk = — 97 af din fi(k)| (39.32) 


where the contour C is that shown in Fig. 36a. The integrand has simple 
poles of unit strength at the zeros of f;(k), each of which corresponds to a 
bound state. Thus the integral on the left side of (39.32) is equal to m. 
As before, we can put In f,(k) = In |f:(k)| + 75:(k); fi(k) changes continu- 
ously around the contour and is never zero on it, so that In |f:(k)| is con- 
tinuous and the contour integral of this term on the right side is zero. In 
accordance with (39.31), 6:(k) changes continuously from 6,(0) at one end 
of the contour just to the right of the origin to — 6,(0) at the other end of 
the contour just to the left of the origin, so that the integral is equal to 
—2i5,(0). We thus obtain a portion of Levinson’s theorem:' 


6:(0) = rm fi(0) #0 (39.33) 


For the exceptional situation in which f,(0) = 0, the contour must be 
indented with a semicircle of infinitesimal radius ¢ in the lower half plane. 
It is still true that the real part of the integrand, In |f:(k)|, contributes 
nothing to the integral. There is, however, an additional contribution 
from the integral of the imaginary part around the infinitesimal semi- 
circle. In order to calculate this, we must know how fi(k) approaches 
zeroask— 0. Itcan be shown that, if f,(0) = 0, then fi(k) ~ ak?, where 
q = lforl = Oandg = 2forl > 0. Ifweputk = ce on the semicircle, 
where ¢ goes from 7 to 2z, the imaginary part of In fi(k) is qd, and the 


1N. Levinson, Kgl. Danske Videnskab. Selskab, Mat.-Fys. Medd. 26, (9) (1949). 
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integral over the semicircle is trg. Thus the complete integral (39.32) 
with the indented contour is equal to 


1 
~ 5(0) — 39 (39.34) 


With] = 0, the zero of fo(k) at k = 0 does not correspond to a bound 
state. The situation here is similar to the resonance scattering case dis- 
cussed in Secs. 19 and 34: In the absence of a centrifugal barrier (1 = 0), 
and with a potential that obeys (39.3) (so that V falls off faster than 1/r), 
the wave function “leaks” out. Then no is equal to (39.34) with q = 1, 
so that 


50(0) = w(no + 4) fo(0) = 0 | (39.35) 
With 1 > 0, the zero of fi(k) at k = 0 does correspond to a bound state 
since, as shown in Sec. 34, the centrifugal barrier prevents a zero-energy 
wave function from leaking out. Then, since this state has been excluded 
by the indented contour, 7 is greater by one unit than (39.34) with q = 2, 
or 7; is equal to 6,(0)/r. This is the same as (39.33), and so we can write 
both cases, together with (39.35), as 

5:(0) = r(m: + 4) ifl=0 and fi(0) = 0 


39.36 
5(0) = rn otherwise ( 


Equations (39.36) are the full statement of Levinson’s theorem. 


EFFECTIVE RANGE 

As a simple explicit example of a Jost function, we now consider 
k + i 

k — ta 


fo(k) = (39.37) 
where «x and a are real and positive. [We neglect any deviation of fi(k) 
from unity for > 0.] Note that this fo(k) has the correct asymptotic 
behavior (39.18), that it has a simple zero at k = —7zx which corresponds 
to a bound state of energy —f?x?/2u, and that it is analytic except for a 
simple pole at k = ia. Equation (39.37) can also be written 


2 2\4 
falt) = (FF) ome 


= -1* = hes -1 
do(k) = tan k + tan i tan 


eae (39.38) 


k? — xa 


Low-energy scattering information, especially in nuclear physics, is 
often presented as a series expansion for k cot 59(k) in powers of k?: 


Ecates (ky = —* nae ee are (39.39) 
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where a is called the scattering length and ro the effective range. It follows 
from the second of Eqs. (39.38) that 


Ka k? 


k+ea k+a 


k cot do(k) = — 


(39.40) 


so that the effective-range formula describes the scattering exactly in this 
case.! Comparison of (39.39) and (39.40) gives 


20 
kK+a 


In order for (39.37) to describe the experimental situation, it is necessary 
that the three experimentally determinable parameters, a, 7o, and x, 
satisfy the equation obtained from (39.41) by elimination of a: 


be 
a 


ro = a= bs + 2 (39.41) 
K a 


= x — Arox? (39.42) 


Equation (39.42) is in fact well satisfied for neutron-proton scattering in 
the triplet spin state, in which case x corresponds to the bound state of 
the deuteron (see Sec. 50). 

The S matrix element that corresponds to (39.37) is, from (39.13), 


Sa(k) = BL iG + tad) 


(k — ix)(k — ta) (39.43) 


It evidently contains less information than the Jost function, since there 
is no way of telling from So(k) alone whether the bound state corresporids 
to x or to a. This provides an explicit example of the remarks in the 
paragraph below Eq. (39.20). 

Effective-range theory can be used as well when there is no bound 
state. Consider the situation when a remains fixed while « decreases to 
zero and then becomes negative. When « = 0, (39.37) shows that 
fo(0) = 0, so that there is a zero-energy resonance; also, (39.41) shows 
that the scattering length becomes infinite. For « small and negative, 
the scattering length is negative, and the parameters can be chosen to 
describe low-energy neutron-proton scattering in the singlet spin state.. 

For all three of these cases, the phase shift 5o(k) at first increases 
from zero as k decreases from infinity. With a bound state, it increases 
monotonically to x atk = 0, and with a zero-energy resonance it increases 
to 47. When « is negative, the phase shift passes through a maximum 
and then decreases to zero at k = 0. All this behavior is in agreement 
with Levinson’s theorem (39.36). 


1V. Bargmann, Rev. Mod. Phys. 21, 488 (1949); R. G. Newton, J. Math. Phys. 1, 
319 (1960), Sec. 10(f). 
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FORWARD SCATTERING AMPLITUDE 


We now derive a dispersion relation for the forward scattering amplitude, 
still assuming that the potential is spherically symmetric.! It is con- 
venient to work with the 7 matrix element (37.2), which is related to 
the amplitude by (37.21). We define 


T (Ea) = |C\-XalT\a) = |Cl? f ud(2)V(r)xa*(e) dr 


= — 72 fake) (89.44) 


Substitution from (37.17) gives 
T(Ea) = |Cl*[fua(e) V(r) uae) d3r 
+ ho ffuz2)V(r')Go.t(r' 0) V(r) ua(r) d3r’ dr] 
We thus obtain 
T(E) = T3 + aif f exp (—7k- r’) 
V(r')G.t(r',r) V(r) exp (tk +r) d’r’ d’r = (39.45) 
= fV(r) dr 


where the Born matrix element 7's is independent of the energy and the 
subscript a has been dropped. 

The Green’s function that appears in (39.45) is defined in Eq. (37.8). 
We adopt 6-function normalization for the continuum and break up the 
generalized summation S, in (37.8) into a sum over bound states n and 
an integral over continuum states a: 

os , ae Xn a(t xa) pa 2 Xa _Xa(t’)xa (4) Xe (r) 

AG, +(r',r) = + [da Ae Xe. area (39.46) 
The x, are normalized to unity, and ze can be dropped in this sum since 
the E, are negative. The normalization of the continuum states is such 
that if x.(r) has the asympototic form exp (7k-r), the differential da 
becomes d?k/(2:)*. 

Equation (39.46) shows that G,,+, regarded as a function of F asa 
complex variable, is analytic everywhere except for discrete poles at the 
negative real bound-state energies EL, and a cut parallel to and just below 
the positive real axis. We define 


T(r',r,E) = fh Gt (rn) V(r) V(r) exp [ik (r — r’)] (39.47) 


IN. N,. Khuri, Phys. Rev. 107, 1148 (1957); A. Klein and C. Zemach, Ann. Phys. 
(N.Y.) 7, 440 (1959). The present treatment follows most closely that of M. L. 
Goldberger and K. M. Watson, ‘‘Collision Theory,’ sec. 10.4 (Wiley, New York, 
1964). 
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Fig. 37. Contour C in Kq. (39.49). 


such that 
T(E) = Te + SfT(e’,r,E) dr’ dr (39.48) 


The analytic properties of 7(r’,r,E) are the same as those of G+, so that 

a dispersion relation can be obtained for it by starting with Cauchy’s 

theorem, as in (39.21). We thus consider the integral 

1 T (r’,t,E’) 
loa 


Rae 


Qri 
where the contour C, shown in Fig. 37, does not cross the cut. This 
integral is equal to a sum of residues at the bound-state poles, which are 
denoted schematically by crosses in the figure, together with a residue 
from the pole at E: 


Ap POCEE) apy — V Ea' 
Silo ere = Ns 


n 


+ T(r’,1,E) (39.49) 


Here I,,(r’,r) is the residue of T(r’,r,E) at the pole E = E,; that is, 
T(r’,r,E) behaves like T',(r’,r)/(Z — E,) in the neighborhood of E = E,. 


DISPERSION RELATION FOR 7(£) 


The contour consists of three parts: the infinitesimal circle about the 
origin, the straight lines just above and just below the positive real axis 
on the two sides of the cut, and the large circle to which we assign the 
radius Ey. With regard to the infinitesimal circle, we note from (39.46) 
that the bound-state contributions to G+ are finite as E — 0 since these 
states are assumed to have finite negative energies. Also, the continuum 
contribution to G,*+ is finite since the k? in E, in the denominator is 
canceled bv the k? in the differential d*k. Thus, with the help of (39.47), 
we see that 7(r’,r,E) remains finite as E’ — 0 in the cut plane, and there 
is no contribution from the infinitesimal circle. 
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The contribution to the left side of (39.49) from the two sides of 
the positive real axis is 


l Eo T (r’,r,E’t) — T(r’ ,1,E'-) 
Sil ee 


where E’+ are just above and below the cut. It is convenient to replace 
this integral with 


J pe Men) ~ Mew) 
2n2t JO E' —E 


which yields the same result when integrated over r and r’, as is done in 
(39.48). We now wish to show that T*(r’,r,E+) = T(r,r',E-). We are 
interested in situations that possess time reversal invariance, so that V 
is real; further, {exp [ckt++ (r ~— r’)]}* = exp [ck-- (r’ — r)], so that in 
accordance with (39.47) we need only show that G,.+*(r',r) = G,-t+(r,r’). 
Equations (36.22) and (37.8) show that G,.+*(r’,r) = G.--(r,r’) when time 
reversal obtains; however, since we are off the real axis, the limit « — 0 
can be taken and the distinction between G+ and G- disappears. Thus 
we have established that the second term in the numerator of the inte- 
grand is the complex conjugate of the first, so that the contribution to 
the left side of (39.49) from the real-axis integrations is 


dk’ 


0 : ay 
J if Im POE) ap (39.50) 


t E’—E 


va 


Finally, we wish to show that the contribution from the large circle 
vanishes in the limit Ey)» ©. The careful analysis required is given in 
the references cited above, and only an indication is mentioned here. 
The main assumption we need is the plausible one that G+ approaches 
the free Green’s function Go,+ as EH — ©, where from (37.19) 


Got (r’ xr) = — r’ — r|-! exp (zk|r’ — r]) 


Hl 
2rh 

It is evidently important that k approach infinity with a positive imagi- 
nary part. That this is actually what happens follows from the earlier 
identification of bound states with poles of the S matrix element or 
scattering amplitude on the positive imaginary k axis and from the pres- 
ent identification of bound states with poles on the negative real E axis. 
Thus the full Riemann sheet of the cut E plane with which we are working 
is to be mapped onto the upper half of the k plane, in accordance with 
E = h?k?/2y.1 It can then be shown that the exponential in the Green’s 
function dominates the plane waves in (39.47), so that ff{[T(r’,r,E) d*r’ d*r 


1 This is called the physical sheet of E; the unphysical sheet maps onto the lower half 
of the k plane. 
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vanishes as E> o. A similar argument shows that the integral (39.50) 
converges as the upper limit approaches infinity. 

We now substitute (39.50) with Ey = © on the left side of (39.49) 
and integrate over r’ andr. With the help of (39.48), we obtain 


, 

where we have replaced the integration variable E’+ = E’ + ie by E’ in 
the last integral and put Tl, = fJT.(r’,r) d*r’ d’r. Equation (39.51) is 
the dispersion relation for the forward scattering amplitude. We recall 
that T(E) = —(2h?/y)f(k,k) from (39.44), that Tz = [V(r) d*r from 
(39.45), and that Im T(E) = —(h?k/2u)ot%.(E) from (20.13). The only 
quantity that remains to be evaluated is I, which is the residue of T(E) 
at the bound-state pole E = E,. 

As a simple example, we note that, if there is a bound state that is 
described by effective range theory, the S matrix element (39.43) is given 
approximately by! 


2ix(k + a@) 
(x — a)(k — ix) 


near the pole at k = ix. Then, near the corresponding pole at k? = ~—x?, 
T(E) is dominated by (—2rh?/2y)(1/27k) times this S matrix element, so 
that 

: _ _ dah? K(k + @) 

ME) = em a) + 


We thus obtain 


(39.52) 


where the effective range ro can be determined experimentally from 
(39.39). 


SUBTRACTED DISPERSION RELATION 


The dispersion relation (89.51) relates a number of measurable quantities 
and hence can, at least in principle, be put to experimental test. Thus 
|7(£)|? is proportional to the differential cross section in the forward 
direction at energy E, |T's|? is the same quantity at infinite energy, 
Im T(E) is proportional to the total cross section, and in the effective- 
range approximation I can be found from (39.52). 


1 It can be shown that the pole of (39.43) at k = ia, which appears in the 1 = 0 partial 
wave but does not correspond to a bound state, is not present in the full amplitude 
T(E). 
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It may happen,. however, that the high-energy region is not very 
accessible or that the integral in (39.51) does not converge rapidly enough 
to be evaluated with available experimental data. In this case a sub- 
tracted dispersion relation can be obtained from (39.51), by subtracting 
from 7'(£) the similar expression for T(E»), where Ey is an arbitrarily 
chosen energy. The result is 


lr, 
2 (E a E,) (Eo fee En) 


5 Im T(E’) 
aJo (E’ — BE — te)(F! — Ey — te) 


T(E) = T(E) + (Ev ~ B) 
az’ (39.53) 


It is apparent that this becomes (39.51) when Ey = ©. An advantage 
of Eq. (39.53) is that the integral converges more rapidly than that in 
(39.51), since its integrand has a higher power of E’ in the denominator; 
on the other hand, it cannot be used unless T(E») is known. 


PROBLEMS 


1. Derive the four equations (36.8) by making use of the arbitrariness of y at any 

instant of time. 

2. Make use of the expression (36.9) for Gy to show explicitly that the first of Eqs. 

(36.8) is valid for Got. 

3. Show that the inner product of any two solutions of the time-dependent Schréd- 

inger equation is constant in time if the hamiltonian is hermitian. Use this result 

first to show that the orthogonality of any two solutions is preserved in time and, 

second, to obtain an expression for (8|S|«) in the form of an inner product that can be 

evaluated at any time (not just at t’ > T2 as in (36.32) or at t < —T, as in (36.38)}. 

4. Show what the connection is between Eq. (35.9) and the second term on the right 

side of Eq. (36.35). 

5. Evaluate the right side of Eq. (37.7) with +7e replaced by —ze, and show that 

it is equal to 6(r) — 1. 

6. Derive Eq. (37.16) by symbolic manipulation of (37.14). Then derive Eq. (37.17) 

without using symbolic relations. 

7. Evaluate Gow (r’,r), defined by Eq. (37.8) with +7e replaced by —ie, and obtain 

it in a form similar to (37.19). Also show that the substitutions (37.22) are justified. 

8. Make use of the unitarity of the S matrix, as given in Eq. (37.31), to derive the 

generalized optical theorem and then the optical theorem as a special case. 

9. Show that the total scattering cross section by a real potential that falls off at 

great distances like r—* is finite if and only if n > 2, first by means of the Born approxi- 

mation amplitude (38.2) and, second, by means of the Born approximation phase 

shifts (38.14). 

10. Find the differential scattering cross section for a real potential V(r) = —Voe7"/s, 

using the Born approximation. What is the validity criterion in this case, and under 

what circumstances is it satisfied? 

11. Use the Born approximation to discuss qualitatively the scattering by a crystal 
lattice of identical atoms. 
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12. Use the optical theorem and the Born approximation amplitude (38.1) to calcu- 
late the total cross section for a real potential. Discuss your result. Repeat for a 
complex potential. ns oe 

13. A proton is scattered from an atom that can be represented by a screened coulomb 
potential —(Ze?/r)e-"*, together with a real attractive short-range square well 
potential of depth — Vo and radius F that arises from the nucleus. Use the distorted 
wave Born approximation to calculate the scattering amplitude. Assume that only 
the | = 0 partial wave is affected by the nuclear potential; treat this exactly, and 
treat the coulomb field to first order. 


14. Use the perturbation theory to calculate the differential collision cross section 
for the 1S — 2P excitation of a hydrogen atom. Show that the total cross section 
becomes the expression given below Eq. (38.29) at high bombarding energy. 


15. Use the eikonal approximation to calculate the total cross section for scattering 
by the potential V(r) = —Vo(1.+ zt) for r <a, V(r) = 0 for r > a, where Vo and 
€ are positive. Find a situation in which you believe that your result should agree 
with that obtained in Prob. 11, Chap. 5, and then check to see if there is indeed 
agreement. 

16. Calculate the Jost function for 1 = 0, fo(k), for a real attractive square well 
potential of depth —V» and radius a. Use it to give as full a discussion as you can of 
any bound states in this potential and of the scattering phase shift. 


10 
Identical Particles and Spin 


The quantum-mechanical theory of particles presented thus far is defi- 
cient in three respects. First, whenever two or more particles are 
described at once, like the electron and proton of the hydrogen atom 
(Sec. 16) or the incident and atomic electrons in an inelastic collision 
(Sec. 38), it is assumed that the particles can be distinguished from each 
other. This is a valid assumption in the first example, since electrons 
and protons possess quite different masses and electrical charges. In 
the second example, however, there is no observable difference between 
the incident and atomic electrons, and the consequences of this identity 
should appear in the formalism. The second defect of the theory is that 
spin has not yet been incorporated into the description of the motion.! 
Third, no mention has been made of the special theory of relativity, which 
is expected to affect the theoretical description of particles that move 
with speeds close to that of light. 


1 See Sec. 27. Spin was first discovered in connection with electrons by G. E. Uhlen- 
beck and 8. Goudsmit, Naturwiss. 18, 953 (1925); Nature 117, 264 (1926). 


362 


IDENTICAL PARTICLES AND SPIN 363 


The ways in which the first two of these defects can be remedied 
are described in this chapter, and illustrative examples are discussed. 
Relativistic effects are taken up in Chap. 13. 


40LIIDENTICAL PARTICLES 


Identical particles cannot be distinguished by means of any inherent 
property, since otherwise they would not be identical in all respects. 
In classical mechanics, the existence of sharply definable trajectories for 
individual particles makes it possible in principle to distinguish between 
particles that are identical except for their paths, since each particle can 
be followed during the course of an experiment. In quantum mechanics, 
the finite size and the spreading of the wave packets that can describe 
individual particles often make it impossible to distinguish between 
identical particles because of their positions, especially if they interact 
with each other to an appreciable extent. This is true of the electrons 
in a single atom, where we have seen that the description in terms of 
moving wave packets breaks down completely. However, the electrons 
of different atoms that are well separated from each other may, to good 
approximation, be regarded as distinguishable. This section considers 
some of the effects of identity on the quantum-mechanical treatment of 
systems of two or more particles. Other effects that involve the spin 
explicitly will be taken up in the remainder of this chapter. 


PHYSICAL MEANING OF IDENTITY 

The impossibility, in principle, of distinguishing between identical parti- 
cles in most quantum-mechanical problems can lead to effects that have 
no classical analog. As an example, we compare the elastic collision of 
two identical particles that have a particular interaction between them 
with the collision of two different particles that have the same interaction 
between them. 

In a classical treatment, there is no difference of principle between 
the results of these two experiments, since it is possible to distinguish 
between the incident and struck particles in the first case as well as in 
the second. In practice, however, this distinction would usually be made 
only in the second experiment. Thus, according to classical mechanics, 
the measured differential cross section in the first experiment is equal to 
the sum of the corresponding cross sections measured for the incident 
and struck particles in the second experiment. In the corresponding 
quantum-mechanical situation, the identical particles in the first experi- 
ment cannot be distinguished by means of their trajectories, since they 
cannot be well localized without interfering with the scattering process. 
Thus the distinction between incident and struck particles has no physical 
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significance, and the simple connection between the results of the two 
experiments that is found in the classical case need not exist. 

We use the word identical to describe particles that can be sub- 
stituted for each other under the most general possible circumstances 
with no change in the physical situation. Identical particles can in some 
cases be distinguished from each other, as when their wave packets do 
not overlap. Another case, discussed more fully in Sec. 41, arises when 
each of the particles possesses an intrinsic spin angular momentum, 
which is a constant of the motion in a particular collision. Then, since 
the component of the spin along some axis is assumed not to change 
during this collision, the particles can be distinguished if they have 
different spin components. Results of this kind must, of course, be a 
consequence of the formalism that we now set up. 


SYMMETRIC AND ANTISYMMETRIC WAVE FUNCTIONS 


The Schrédinger wave equation for n identical particles is 
ih y (1,2, oad, SE) = AGS a a OWT 2 ee GD) (40.1) 


where each of the numbers represents all the coordinates (positional and 
spin) of one of the particles. The hamiltonian H is symmetrical in its 
arguments, since the identity of the particles means that they can be 
substituted for each other without changing H or, indeed, any other 
observable. 

There are two kinds of solutions ¥ of Eq. (40.1) that possess sym- 
metry properties of particular interest. A wave function is symmetric if 
the interchange of any pair of particles among its arguments leaves the 
wave function unchanged. A wave function is antisymmetric if the inter- 
change of any pair of particles changes the sign of y. We now show that 
the symmetry character of a wave function does not change in time. If 
Ws is symmetric at a particular time ¢, then Hys is also symmetric, and 
(40.1) states that dys/dt is symmetric. Since ys and its time derivative 
are symmetric at time t, ys at an infinitesimally later time ¢ + dt is given 
by vs + (d~s/dt) dt and is also symmetric. Such a step-by-step integra- 
tion of the wave equation can, in principle, be continued for arbitrarily 
large time intervals, and ys is seen to remain symmetric always. In 
similar fashion, if ya is antisymmetric at any time, Hwa and hence 
dy,/dt are antisymmetric, and integration of the wave equation shows 
that ya is always antisymmetric. 

The foregoing proof is not altered if H andy have as their arguments 
the coordinates of two or more different groups of identical particles; 
thus a wave function that is initially set up to be symmetric or anti- 
symmetric in the coordinates of each identical particle group always 
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retains this character. This makes it possible for the different groups of 
identical particles found in nature to have definite symmetry characters, 
and this is actually found to be the case. Although symmetry char- 
acters other than these two would also be preserved in time by a sym- 
metric hamiltonian, they do not appear to correspond to particles found 
in nature.! 


CONSTRUCTION FROM UNSYMMETRIZED FUNCTIONS 


We now show how ys or Wa can be constructed from a general unsym- 
metrized solution y of Eq. (40.1). If the arguments of y are permuted 
in any way, the resulting function is a solution of (40.1). That this is 
true follows from the observation that the same permutation applied 
throughout Eq. (40.1) does not impair its validity, since it corresponds 
simply to a relabeling of the particles; then, since H is symmetric, the 
permuted H is the same as the original H, and the resulting equation is 
the same as (40.1) for the permuted y. In this way n! solutions can be 
obtained from any one solution, each of which corresponds to one of the 
n! permutations of the n arguments of y. It is evident that any linear 
combination of these functions is also a solution of the wave equation 
(40.1). 

The sum of all these functions is a symmetric (unnormalized) wave 
function ys, since the interchange of any pair of particles changes any 
one of the component functions into another of them and the latter into 
the former, leaving the entire wave function unchanged. An antisym- 
metric unnormalized wave function can be constructed by adding together 
all the permuted functions that arise from the original solution by means 
of an even number of interchanges of pairs of particles, and subtracting 
the sum of all the permuted functions that arise by means of an odd num- 
ber of interchanges of pairs of particles in the original solution. It is 
apparent that a nonvanishing antisymmetric wave function cannot be 
formed from a solution that is unaltered by the interchange of any pair 
of particles. 

In the event that the hamiltonian does not involve the time, sta- 


tionary solutions (1,2, . . . ,n;t) = u(1,2, . . . ,n)e~*¥#/* can be found, 
where 
[H(1,2... ,n) — Eju(1,2,... n) = 0 


The earlier discussion shows that the solutions derived from any u by 
means of permutations of its arguments are degenerate with the original 
u; this is called exchange degeneracy. 


1 For fuller discussion of this point, see O. W. Greenberg and A. M. L. Messiah, Phys. 
Rev. 188, B1155 (1965); M. D. Girardeau, Phys. Rev. 189, B500 (1965). 
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THE SYMMETRIC GROUP! 


The symmetry of the hamiltonian with respect to permutations of its 
arguments implies the existence of a group of transformations on the 
state functions, in accordance with the general discussion of Chap. 7. 
We again adopt the Schrédinger picture and write in analogy with Eq. 
(26.1) 


Ya [P(1,2, .. . ,n)3t] = wa(1,2 . . . ,n;t) (40.2) 


where P is some permutation of the m arguments. Then the analog of 
Eqs. (26.2) are 


U,(P)|e) = |e’) or U,(P)va(1,2, . . . ,n5t) 
= Wa(1,2,..°. nit) (40.3) 


where U,(P) is the unitary permutation operator. Combination of Eqs. 
(40.2) and (40.3) leads to the analog of (26.3): 


U,(P)Wall,2, .. . nit) = palP*(1,2, . . . sm) if] (40.4) 


where P-! is the inverse permutation to P, such that PP~1 = P-'!P = 1, 
the identity permutation. It also follows, in analogy with Eq. (26.5), 
that the symmetry of the hamiltonian implies that [U,(P),H] = 0 and 
hence that a permuted solution represents a possible motion of the 
system. 

The n! permutations P of n objects evidently form a discrete group 
with n! elements, which is called the symmetric group. The n! operators 
U,(P) are isomorphic to the permutations. 

When n = 2, there are only two permutations, the identity and the 
interchange, which we call 1 and P, respectively. If P is applied twice 
in succession to two objects, we obtain the identity. Thus the sym- 
metric group of two objects is isomorphic to the space-inversion group 
discussed in Sec. 29. When use is made of Eq. (40.4), it is apparent 
that the analog of Eqs. (29.3) are (on omitting the time dependence) 


U,(P)u(1,2) = wu(2,1) U,>(P)u(1,2) = wu(1,2) 


Again, U,?(P) is expected to bring a state into itself, so that w? is a number 
of unit magnitude. Unlike the situation with inversion, there is now no 
reason to assume that w? is anything other than unity. Thus» = +1, 
and these characterize the symmetric and antisymmetric states, respec- 
tively. We thus obtain two (unit rank) matrix representations of the 


1. P. Wigner, ‘Group Theory,” chap. 13 (Academic, New York, 1959); A. Messiah, 
“Quantum Mechanics,” App. D IV (North Holland Publishing Company, Amster- 
dam, 1962). 
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symmetric group when n = 2: 

U,(1) =1 U,(P)=1 and U,(1) =1 U,(P) = —1 (40.5) 
The corresponding eientunetiote of these operators are 

u(1,2) + u(2,1) (40.6) 


It can be shown for any value of n that the only two matrix repre- 
sentations U,(P) that are of unit rank are the analogs of (40.5): the 
symmetric representation U,(P) = 1 for all P and the antisymmetric 
representation U,(P) = +1 according as the number of interchanges 
that make up the permutation P is even or odd. For example, with 
n = 3, the symmetric and antisymmetric eigenfunctions are 


[u(1,2,3) + u(2,3,1) + u(3,1,2)] + [u(2,1,3) + u(1,3,2) + u(3,2,1)] 
(40.7) 


It can also be shown that there is only one other independent representa- 
tion of the U,(P) when n = 3; this consists of six 2 X 2 matrices and 
hence operates on pairs of eigenfunctions (see Prob. 1). 


DISTINGUISHABILITY OF IDENTICAL PARTICLES 


It is to be expected that the result of an experiment is independent of 
the symmetry character of the wave function if the coordinates of the 
particles do not overlap. This corresponds to a situation in which the 
particles can be distinguished by means of their positions (or their spin 
components) even though they are identical. Such a situation implies, 
in the case of two particles, that the wave function u(1,2) is different 
from zero only when the coordinate 1 is in some region A, the coordinate 
2 is in a region B, and A and B have no common domain. 

The coordinate probability density associated with the wave func- 
tion u(1,2) is |w(1,2)|?, and the densities associated with the symmetrized 
wave functions (40.6) are 


ju(1,2) + u(2,1)|? = |u(1,2)|? + |u(2,1)|? + 2 Re [u(1,2)u*(2,1)] 
; (40.8) 


where Re denotes the real part of the expression in brackets. If now 
u(1,2) vanishes whenever 1 is not in A and 2 is not in B, and A and B do 
not overlap, the bracket term is zero everywhere, and (40.8) becomes 
|u(1,2)|? + |u(2,1)|?. 

Thus the density associated with either of the symmetrized wave 
functions (40.6) is the sum of the densities associated with u(1,2) and 
u(2,1) separately. This is precisely the result that will be obtained if 
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the particles are not identical but no attempt is made to distinguish 
between them in performing the experiment. Thus the interference 
effects between exchange-degenerate wave functions, represented by the 
bracket term in (40.8), disappear when the coordinates of the particles 
do not overlap. 


THE EXCLUSION PRINCIPLE 


In many problems, a useful zero-order approximation can be obtained 
by neglecting the interactions between the particles that make up the 
system under consideration. The approximate (unperturbed) hamilto- 
nian is the sum of equal hamiltonian functions for the separate particles 


H,(1,2,... ,n) = Ho(1) + W4(2) + . . . + Aon) (40.9) 


and the approximate energy eigenfunction is a product of one-particle 
eigenfunctions of Hj: ; 


u(1,2, .. . »m) = va(1)ue(2) > + * v,(n) 
KE=EH,+Het+::: +H#, (40.10) 
Hy(1)va(1) = Eava(1) etc. 


If the particles are electrons, an antisymmetric wave function must 
be constructed from the u given by (40.10). This is most easily expressed 
as a determinant of the v’s: 


Vq(1) Va(2) oases Va(n) 

ve(1) vg(2) ie vg (7) 
ua(lj2,...n)=fo0 ce tet tee (40.11) 

v,(1) v,(2) aa v,(n) 


The (unnormalized) us given in (40.11) is clearly an antisymmetric 
solution of the approximate wave equation (Hy — E)ua = 0. 

Equation (40.11) has the interesting property that it vanishes if two 
or more of the v’s are the same. This is a special case of the general 
result stated earlier that an antisymmetric wave function cannot be 
constructed from a solution that is unaltered by the interchange of any 
pair of particles. Thus the approximate hamiltonian H» has no solu- 
tions for which there is more than one electron in any one of the states 
a, B, ...,». This result is known as the exclusion principle and was 
first postulated by Pauli! as an explanation of the periodic system of the 
chemical elements. 


1 W. Pauli, Z. Physik 81, 765 (1925). 
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CONNECTION WITH STATISTICAL MECHANICS 


The unsymmetrized zero-order solution given in Eq. (40.10) can be used 
to construct a symmetric as well as an antisymmetric wave function. 
Such a symmetric (unnormalized) function is easily seen to be the sum 
of all different permutations of the numbers 1, 2, . . . , m among the one- 
particle eigenfunctions »,, vg, . . ., v,. This wave function is unique 
and can be specified simply by stating how many particles are in each of 
the states a, 8, .... In the same way, an antisymmetric wave func- 
tion can be specified by stating how many particles are in each state. 
The fundamental statistical difference between particles that are described 
by antisymmetric and by symmetric wave functions is that the number 
of the former type that can occupy any state is limited to 0 or 1, whereas 
any number (0,1,2, . . .) of the latter type of particles can occupy any 
state. , 

The treatment of aggregates of large numbers of noninteracting (or 
weakly interacting) particles for which the states can be enumerated in 
these two ways forms the subject matter of quantum statistical mechanics. 
Particles that are described by antisymmetric wave functions are said to 
obey Fermi-Dirac statistics and are called fermions, and particles that are 
described by symmetric wave functions. obey Einstein~Bose statistics and 
are called bosons.! 

Of the material particles whose statistics are definitely known, 
electrons, protons, and neutrons are fermions, and 7 mesons are bosons.? 
Also, light quanta, or photons, insofar as they can be treated as particles, 
obey Einstein-Bose statistics. Further, aggregates of particles that are 
sufficiently tightly bound so that they can be regarded as “‘particles’”’ 
are described either by symmetric or by antisymmetric wave functions. 

For example, the nucleus of a helium atom is made up of two pro- 
tons, two neutrons, and an indeterminate number of x mesons, which are 
strongly bound together. If we consider a number of helium nuclei that 
interact with each other weakly enough so that the changes in the internal 
motions of the nuclei can be neglected, we can see that the motions of the 
centers of gravity of the nuclei can be described approximately by a sym- 
metric wave function. The interchange of a pair of helium nuclei can 
be thought of as the resultant of the interchanges of two pairs of protons, 
two pairs of neutrons, and a number of pairs of x mesons. Since the 
actual wave function is antisymmetric in all the protons and in all the 
neutrons, the resultant of the first four interchanges leaves the approxi- 
mate wave function unchanged; the symmetry of the wave function in the 
mw mesons is such that the latter interchanges also have no effect. By an 
1 See, for example, K. Huang, ‘Statistical Mechanics,” chap. 9 (Wiley, New York, 
1963). 

2 See, for example, the references cited on page 205. 
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extension of this argument, we see that strongly bound “particles” that 
interact weakly with each other (nuclei, atoms, or molecules) obey Ein- 
stein-Bose statistics when each of them consists of an even total number 
of electrons; protons, and neutrons, and they obey Fermi-Dirac statistics 
when each consists of an odd total number of these particles.} 


COLLISIONS OF IDENTICAL PARTICLES 


When the only forces acting on two particles result from their mutual 
interaction, the overall motion can be separated into motion of the center 
of mass of the two particles and motion of the particles relative to each 
other, as discussed in Secs. 16 and 18. It is apparent that an interchange 
of two identical particles does not affect the position vector of the center 
of mass [which is $(r1 + rz) since the particles have equal masses] but 
changes the sign of the relative position vector r (= t1 — 12). We post- 
pone consideration of the spins of the particles until the next section and 
see now what effect symmetry or antisymmetry of the space part of the 
wave function has on the elastic scattering of a particle by another that is 
identical with it. 

The asymptotic form of the unsymmetrized scattering wave function 
in the center-of-mass coordinate system is given by Eq. (18.10). 


u(r) Pa ett + r—lf(0,p)e*" (40.12) 


where r, 6, ¢ are the polar coordinates of the relative position vector r. 
Since the polar coordinates of the vector —r are r, 7 — 6, 6 +7, the 
asymptotic forms of the symmetric and antisymmetric wave functions 
formed from (40.12) are given by 


(e# + eH) + [f(0,6) + fr — 6, + m)]rte* (40.13) 


with upper and lower signs, respectively. 

From the discussion of Sec. 18, it follows that the differential scatter- 
ing cross section in the center-of-mass coordinate system is the square of 
the magnitude of the bracket term in (40.13): 


o(9,6) = |f(8,¢)|? + |f@ — 6, 6 + 7)|? 
+ 2 Re [f(0,6)f*@ — 6,¢+7)] (40.14) 
The normalization adopted here can be justified by noticing that in the 
classical limit, where the identical particles are distinguishable and the 


last (interference) term in Eq. (40.14) drops out, o(8,¢) becomes just 
the sum of the differential cross sections for observation of the incident 


1A more rigorous treatment that leads to the same conclusion has been given by 
P. Ehrenfest and J. R. Oppenheimer, Phys. Rev. 37, 333 (1931). 


IDENTICAL PARTICLES AND SPIN 371 


particle (|f(0,¢)|*) and of the struck particle (|f(r — 6, @ + 7)|?), as it 
should. 

In the usual case, for which f is independent of ¢, it is apparent that 
the scattering per unit solid angle is symmetrical about 6 = 90° in the 
center-of-mass coordinate system. It is easily seen from Eq. (18.7) with 
y = 1 that the scattering per unit angle (not per unit solid angle) in the 
laboratory coordinate system, 


o0(80) sin 6) = 4 cos Oo Sin Oo {|f(200) |? 
+ [f(r — 260)|? + 2 Re [f(260) f*( — 26)]} 


is symmetrical about 0) = 45°. 


410SPIN ANGULAR MOMENTUM 


The treatment of identical particles presented in the preceding section 
must now be supplemented by inclusion of the spin angular momenta of 
the particles. It was shown in Sec. 27 that the spin angular momentum 
S of a particle can be defined so that S? commutes with all dynamical 
variables. Then S? is strictly a constant of the motion and can be 
replaced by s(s + 1)h?, where s is an integer or half an odd integer. 


CONNECTION BETWEEN SPIN AND STATISTICS 


As remarked in Sec. 27, electrons, protons, neutrons, neutrinos, and 
muons have s = 4, and pions have s = 0. Aggregates of particles that 
are sufficiently tightly bound can be regarded as “particles” and can be 
characterized by definite magnitudes of their total internal angular 
momenta, so long as their internal motions and the relative spin orienta- 
tions of their component particles are not significantly affected by the 
interactions between aggregates. This is exactly analogous to the situa- 
tion with regard to the statistics obeyed by the aggregates, discussed in 
the preceding section. 

The treatment of the addition of angular momenta presented in 
Sec. 28 shows how to calculate the possible magnitudes of the total 
internal angular momentum of any aggregate of fundamental particles; 
we Call this the spin of the aggregate. If it consists of n particles, each of 
which has s = 4, and any number of particles with s = 0, and if the 
internal orbital angular momentum of these particles is ignored, the total 
s can be any integer from 0 to 3n if n is even or can vary by integer steps 
from ¢ to gn if nis odd. The total orbital angular momentum quantum 
number can be shown to be an integer or zero in general; its inclusion 
1 The work of Secs. 27 and 28 shows that this is true for noninteracting particles 


that move in central force fields, and the result turns out not to be affected by particle 
interactions. 
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extends the maximum value of s for the aggregate but does not alter the 
conclusion that s is zero or an integer if n is even and is half an odd integer 
if n is odd. 

We see then that, for the known fundamental particles and for 
aggregates of them that have a definite spin, there is a unique connection 
between the spin and the statistics. Particles or aggregates that have 
zero or integer spin are described by symmetric wave functions and obey 
Einstein-Bose statistics, and particles or aggregates that have half-odd- 
integer spin are described by antisymmetric wave functions and obey 
Fermi-Dirac statistics. There is good theoretical reason, based on 
relativistic quantum mechanics,’ to believe that this connection also 
holds for other fundamental particles whose spin and statistics have not 
as yet both been determined (other mesons and neutrinos). 


SPIN MATRICES AND EIGENFUNCTIONS 


The spin can be included in the formalism developed in Sec. 40 by having 
each of the numbers 1, 2, . . . , 1 that appear as the arguments of y and 
u represent a spin coordinate as well as the three space coordinates of that 
particle. The spin coordinate differs from the space coordinates in that 
it takes on only 2s + 1 values for a particle (or aggregate) of spin s, 
instead of the infinite number of values that are taken on by each space 
coordinate. The spin wave function of a single particle is completely 
determined by the specification of 2s + 1 numbers, whereas the space 
wave function involves the specification of a continuously infinite set of 
numbers (which is equivalent to a continuous function of the space 
coordinates) .” 

A convenient set of orthonormal one-particle spin functions is pro- 
vided by the normalized eigenfunctions of the J? and J, matrices given 
in Eqs. (27.26). These eigenfunctions are (2s + 1)-row, one-column 
matrices that have zeros in all positions except one. For example, if 
s = 3, the four spin eigenfunctions are easily seen to be 


1 0 0 0 

0 1 0 0 
WH =(5) @=lo} %~-P=|q} P= 
0 0 0 1 

(41.1) 


1R. F. Streater and A. S. Wightman, “PCT, Spin and Statistics, and All That,” 
sec. 4-4 (Benjamin, New York, 1964). 


2 If the space and spin motions are closely enough coupled together, the space wave 


function may depend on the spin coordinate, so that 2s + 1 space functions are 
required. 
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and correspond to S, eigenvalues of 3, $h, —gh, and —$h, respectively. 
The orthonormality is demonstrated by multiplying the hermitian adjoint 
of one spin function into itself or another function 


{0 100]] 0 [0100]; 0 
-|l 0 

5 =] 1 = 0 etc. 
0 0 


with the help of the usual rule for matrix multiplication. 

Symmetric or antisymmetric many-particle wave functions can be 
constructed from unsymmetrized solutions that include the spin by fol- 
lowing the procedure outlined in the preceding section. It is sometimes 
convenient to choose the unsymmetrized solutions to be eigenfunctions 
of the square of the magnitude of the total spin of the identical particles 
(Si+S.+ ---+S8,)? and of the z component of this total spin 
Siz + So. +--+ + Snrz These quantities are constants of the motion 
if the hamiltonian does not contain interaction terms between the spins 
and other angular momenta. In addition, such functions are often useful 
as zero-order wave functions when the spin interactions are weak enough 
to be regarded as a perturbation. There is no loss of generality in choos- 
ing the unsymmetrized solutions in this way, since any solution can be 
expressed as a linear combination of total-spin eigenfunctions. 


COLLISIONS OF IDENTICAL PARTICLES 


The effect of spin on the collision of two identical particles (or aggregates) 
can now be taken into account if the interaction between the particles 
does not involve the spin. Since each particle has 2s + 1 spin eigen- 
functions, there are altogether (2s + 1)? independent spin functions for 
the pair, each of which is a product of one-particle spin functions. 

Any (2s + 1)? linearly independent combinations of these products 
can be used in place of them. These are conveniently divided into three 
classes. The first class consists of products of one-particle functions in 
which both particles are in the same spin state with S, value mh: 


v1(m)v2(m) —s<mcss 
where the subscript specifies which of the particles is in each state; there 
are evidently 2s + 1 such states. The second class consists of sums of 
products 


01(m’)v2(m"’) + o1(m"")v2(m’) m! = m" 


There are s(2s + 1) of these states. The third class consists of differ- 
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ences of products 
v1(m’)v2(m"’) — v1(m’’)v2(m’) m’ ~ m’’ 


Again there are s(2s + 1) of these. 

The first two classes are clearly symmetric in an interchange of the 
spin coordinates of the two particles, and the third class is antisymmetric 
in such an interchange. Thus the total of (2s + 1)? states can be divided 
into (s + 1)(2s + 1) symmetric and s(2s+ 1) antisymmetric states. 
Associated with the symmetric spin states must be a symmetric space 
state if s is an integer (symmetric total wave function), and an anti- 
symmetric space state if s is half an odd integer (antisymmetric total 
wave function). Similarly, the antisymmetric spin states multiply 
antisymmetric space states if 2s is even and multiply symmetric states 
if 2sisodd. We cansee then that, if all the spin states are equally likely to 
appear in a collision,! a fraction (s + 1)/(2s + 1) of the collisions will be 
described by the wave function (40.13) with the upper sign, and a fraction 
s/(2s + 1) will be described by (40.13) with the lower sign, if 2s is even. 

This and the similar result for 2s odd can be summarized by rewriting 
Eq. (40.14): 


(=)* . 
2841 2 Re [f(O)f*(r — 6)] (41.2) 


(8) = [f(@)|* + Ife — )| + 


where f is assumed to be independent of ¢. 

Equation (41.2) can also be derived by making use of the earlier 
observation that particles that have different spin components are dis- 
tinguishable, in which case the interference term in (40.14) disappears. 
This occurs in a fraction 2s/(2s + 1) of the collisions. In the remaining 
_ fraction 1/(2s + 1) of the collisions, the particles have the same spin 
component, and the symmetric or antisymmetric space state (upper or 
lower sign in the interference term) must be used according as 2s is even 
or odd. 


ELECTRON SPIN FUNCTIONS 


In the remainder of this section we consider only electron spin 
functions (s = 4). The spin matrices are given by the first line of Eq. 
(27.26) and may be written as S = 4hé, where 


os = b | oy = e =a *. = E a (41.3) 


are the Pauli spin matrices (27.30). The normalized eigenfunctions of S, 


1 This is the fundamental postulate of quantum statistical mechanics; see, for example, 
Huang, loc. cit. 
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may be written in analogy with Eqs. (41.1) as 
1 0 
w=[5]  «--(9] (41.4 


and have eigenvalues $f and —3h, respectively; they are both eigen- 
functions of S? with the same eigenvalue #h’. 

Since we shall have occasion to write the products of spin functions 
for different electrons, it is convenient to abbreviate the notation as 
follows: 


v1($)ve(—4)v3(4) a(S) = Ce) ete. 


where the first particle has the eigenvalue 3f for S:., the second has the 
eigenvalue —h for Se, etc. S,; has no effect on the spin functions of 
any but the first particle. 

The following formulas are easily obtained from (41.3) and (41.4): 

O,0y = —O,0, = Ws etc. 

o-(+) = (—) oy(+) = u(—) o(+) = (Ey 

o2(—) = (+) o(—) = —1(+) a(—) = —(—) (41.5) 

There are four linearly independent spin functions for a pair of elec- 
trons: (++), (+—), (—+), (——). These are orthonormal, since the 
one-particle spin functions (41.4) are orthonormal. As remarked earlier, 
it is often convenient to regroup these functions into combinations that 
are eigenfunctions of (Si + S2)? and S;, + Se. These combinations are 
easily found by using Eq. (28.1) with the Clebsch-Gordan coefficients 


(28.12); the orthonormality and the indicated eigenvalues can also be 
verified with the help of (41.5): 


. (S: + S.)? : Su + So 
(++) oh h 
23+ —) + (=F )] 2h? 0 (41.6) 
(—~) Qh? —h 
2-(+—-) — (—+)] 0 0 


As expected from Sec. 28, the first three of the two-particle spin 
functions (41.6) together behave in all respects like a single ‘“‘particle’’ of 
spin s = 1, and the last of the spin functions (41.6) behaves like a single 
“particle” of spin s = 0.1 Not only do they have the proper eigenvalues 
of the square of the magnitude of the total spin and the z component of 
the total spin, but the result of operating on the triplet spin function with 
the z or y components of the total spin is in agreement with the corre- 
sponding matrices in the second line of Eq. (27.26). 

1 The first three states are called a triplet and the last a singlet. In the old quantum 


theory, the triplet corresponds to parallel electron spins and the singlet to antiparallel 
spins. 
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THE HELIUM ATOM 


The ground state of the helium atom was considered from the point of 
view of the variation method in Sec. 32. We now consider the ground 
and first excited states of helium with the help of the somewhat simpler 
first-order perturbation theory of Sec. 31; the symmetry effects of the 
spins of the two electrons are taken into account, although spin-dependent 
forces are neglected. We use products of hydrogenic wave functions unin 
(with Z = 2) as the unperturbed eigenfunctions of the problem and are 
interested in classifying the states according to symmetry and spin prop- 
erties rather than in obtaining accurate energy levels. 

In spectroscopic notation, the ground state of helium is the 1s? state: 
Both electrons are in the hydrogenic state uioo. Since this space state is 
symmetric, the spin state that multiplies it must be the antisymmetric 
singlet given as the last of the functions (41.6), for which the total spin 
is zero. 

The space part of the first excited state of helium is eightfold degen- 
erate in the zero-order approximation. The spectroscopic configurations 
are 1s2s and 1s2p. Apart from electron exchange, the first state is non- 
degenerate and the second is triply degenerate (because of the three 2p 
states); the exchange degeneracy doubles the number of states, since 
either electron can occupy the 1s state and the other the 2s or 2p state. 
In order to simplify matters, we consider here only the doubly (exchange) 
degenerate 1s2s state; it is not difficult to show that the 1s2p states can 
be treated separately (see Prob. 7). 

The perturbation energy is the electrostatic repulsion between the 
electrons e¢?/ri2, and the unperturbed states are wy00(f1)U200(r2) and 
U100(f2)Ue2o0(F1). The spin need not be considered explicitly at this point 
since the spin-dependent forces are neglected; appropriate spin functions 
will be multiplied in later to make the entire wave function antisym- 
metric. The matrix of the perturbation for these two states has the 
structure obtained from Eqs. (31.15) and can be written 


- a . (41.7) 


where 


2 
J = ff Uxo0(T1) Udo (Te) ae U100(T1) U200(T2) d*r; d®re 
Ti12 

i (41.8) 
K = iS if Utoo(T1) USo0(T2) — Uioo(F2)U200(T1) d?r1 dre 
Ti2 


J is often called the direct or coulomb energy, and K the exchange energy. 

Application of the diagonalization technique of Sec. 31 (see the treat- 
ment of the first-order Stark effect in hydrogen) shows that the eigen- 
values of the perturbation (41.7) are J + K andJ — K; they correspond 
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to the normalized eigenfunctions 2—![w100(¥1) U200(F2) + troo(F2)U200(F1)] and 
2-4 u100(81) U200(T2) = U100(T2) U200(F1)], respectively. Since the first of 
these is a symmetric space function, it must be multiplied by the anti- 
symmetric singlet spin function. Similarly the second, which is an anti- 
symmetric space function, must be multiplied by one of the symmetric 
spin functions that make up the triplet in (41.6). Since K turns out to be 
positive, the singlet spin state has a substantially higher energy than the 
triplet spin state. This is due not to a spin-dependent interaction but 
to a coupling between the spins and the electrostatic interaction that is 
introduced by the exclusion principle (use of antisymmetric wave func- 
tions). From a physical point of view, the exclusion principle causes the 
electrons to be in different space states if they have parallel spins. They 
then tend to keep away from each other, and this reduces the electro- 
static repulsion between them and hence lowers the energy. 


SPIN FUNCTIONS FOR THREE ELECTRONS 


In the treatment of exchange scattering from helium given in Sec. 43, we 
shall require eigenfunctions of the total spin of three electrons that are 
analogous to those given in Eqs. (41.6) for two electrons. We can regard 
three electrons as 1 + 2 electrons, in the sense that we can combine an 
electron (s = 4) with the triplet two-electron function (s = 1) and with 
the singlet function (s = 0). In the first case, the results on addition of 
angular momenta, given in Sec. 28, show that we should get two groups 
of three-electron spin functions that correspond to s = 4 and s = §; in 
the second case we should get a single group that corresponds to s = 3. 
We thus expect one quartet group of spin states (s = $) and two distinct 
doublet groups of spin states (s = 4), or a total of 4 + 2 + 2 = 8 individ- 
ual three-electron spin states. These must be expressible as linear com- 
binations of the 2? = 8 products of one-electron spin functions. 

Again, these combinations are obtained from Eq. (28.1), with the 
Clebsch-Gordan coefficients now being given by (28.13); the two-electron 
functions required (for electrons 2 and 3) are those of (41.6). We thus 
obtain 

(Si + S2 + S;)? Siz + So: + Sas 


(+++) 1S $n 

34(++—-) +(4+—-4+) +(-4+4)] ASH? An 

i te et a ol Cee er oe Ge ee) 15H? —2h 

(Hae) 15H — 3h 
eH — +4) — (44) -— (4-4) an a «EY 

64(—-+—-) +(--+) - &A+--)l Ftd — gh 

AES) Ch H+] an 1A 


24(—+-) -—(--4) 342 ih 
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The orthonormality and the indicated eigenvalues can be verified directly. 
The first four (quartet) states are symmetric in the interchange of any 
pair of particles. The division of the four doublet states into two pairs is 
such that the first pair is symmetric in the interchange of particles 2 and 3, 
and the second pair is antisymmetric in 2 and 3. The symmetry with 
respect to interchanges of the other two pairs is characterized by the2 x 2 
matrices mentioned below Eq. (40.7); these matrices operate on either 
pair of doublet spin states that have the same m value (see Prob. 8). 


427]DENSITY OPERATOR AND DENSITY MATRIX 


The theoretical development thus far has dealt entirely with systems in a 
pure quantum state which is represented by a single ket |a). It often 
happens, however, that our knowledge of the state of a system is incom- 
plete; for example, we may be able to say no more than that the system 
has nonnegative probabilities pa, ps, . . . for being in the states |a), 
|8), . . .. This incompleteness may be unimportant, in which case the 
theory of pure states is adequate; otherwise, a statistical approach is 
necessary.'. This can be developed in close analogy with the classical 
situation. A pure classical state is one that is represented by a single 
moving point in phase space, that has definite values of the coordinates 
qi. ° -: gy and their canonical momenta pi - - - p; at each instant of 
time. A statistical state, on the other hand, can be described by a non- 
negative density function p(q: - - - gy, 1°: - py, t) such that the proba- 
bility that a system is found in the interval dq, - - - dp; at time ¢ is 
pdqi- ++ dps. The quantum analog of the classical density function is 
the density operator or its representation as a density matrix. Before 
introducing it, we discuss an alternative description of pure quantum 
states. 


EXPECTATION VALUE AND PROJECTION OPERATOR 


The expectation value of an operator © for a discrete normalized pure 
state |a) is (a|Q|a). It may be written in terms of any complete orthonor- 
mal set of kets |2) as 


(a|Qla) = (ale) @lj)Jla) = (¢|2\7)(7| Pal?) (42.1) 


Here P, = |a){a| is the projection operator for the single state a, defined 
as in Eq. (23.36); the generalized summations over 1 and j are omitted in 
accordance with the convention (23.33). The right side of (42.1) can be 
written (2|QP,|2) or (j|P.2|7); we thus have the relations 


(a|Qla) = tr (QP) = tr (P,Q) (42.2) 
1 For a general review, see U. Fano, Rev. Mod. Phys. 29, 74 (1957). 
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As remarked in connection with Eqs. (23.40), Pa is hermitian, and 
| ae tr (P.) = 1 (42.3) 


It is apparent that P. contains the same information as |a), except for an 
overall multiplicative phase factor which does not enter into the calcula- 
tion of expectation values. 

We shall adopt the Schrédinger picture throughout this section and 
omit the subscript S. ‘Then the equation of motion for P,, can be obtained 
from Eqs. (24.1) and (24.2) 


: {ad . d 
in P. = th € o>) (a| + thle) at (al) 


H\a){a| — |aXol|H = [H,Pal (42.4) 


Equation (42.4) resembles the Heisenberg equation of motion (24.10), 
except for a sign; however, the resemblance is misleading, and it must be 
remembered that P. is being viewed in the Schrédinger picture. Indeed, 
in the Heisenberg picture |a) and hence also P. are constants. . 


DENSITY OPERATOR 


As remarked above, we shall assume that a system that is not necessarily 
in a pure quantum state can be described by a set of probabilities pa, 
ps, - - - for being in the states |x), |8), . . - » With random phase differ- 
ences between their amplitudes.! This is called a statistical state. A pure 
state is then a special case of a statistical state, in which one of the pa’s 18 
equal to unity and all the others are zero. 

It is convenient to assume that the states a are orthonormal, 
although not necessarily complete, so that 


PaPs = SapPa pa = 1 Da = 0 (42.5) 
We now define a hermitian operator p that corresponds to this set of 


probabilities 
P= Y PaPa = Y \a)pate| (42.6) 


It then follows from (42.2) and (42.5) that the average expectation value 
of 2 that corresponds to these probabilities is 


(D)av = > Pa(a|Q|a) = tr (Qp) = tr (pQ) (42.7) 


1 For a discussion of statistical ensembles and of the relation of.the density operator 
and matrix to statistical thermodynamics, see Huang, loc. cit. 
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It is also apparent that, in analogy with Eqs. (42.3), 


tr (p) = 1 tr (0?) = Vp? <1 (42.8) 


where the last equal sign can hold only if the state is pure. 


EQUATIONS OF MOTION 


The equation of motion of p in the Schrédinger picture, which is being 
used here, is easily obtained from Eqs. (42.4) and (42.6): 


ad 
ih 7 = [Hyp] (42.9) 


Use has been made here of the fact that the p, are constant in time. This 
is evidently true for an isolated system and is also true in general if H is 
designed to include external influences. Again, the resemblance to the 
Heisenberg equation of motion (24.10) is misleading since p, like P, above, 
is being viewed in the Schrédinger picture. As with P,, p is constant in 
the Heisenberg picture; this can also be verified by transforming (42.9) 
from the Schrédinger to the Heisenberg picture. 

The equation of motion of (),y is also of interest since it is some- 
times what is observed in an experiment. It follows from Eqs. (42.7) and 
(42.9), when it is remembered that the Schrédinger picture is being used 
for 2 as well as for p, that 


inf Qe. = th tr (3 0) + tr (Q[H,p)) 


tr IG - fs io,m1) | (42,10) 


The classical equation that corresponds to (42.9) is known as 
Lnouville’s theorem;} it may be written in either of the forms 


Le OB 4775) (42.11) 


which are equivalent to each other on account of Eq. (24.22). Once 
again, Eqs. (42.11), taken together with (24.23), agree with the quantum 
results that p is constant in the Heisenberg picture. 

The classical analog of (Q),, defined by (42.7) is 


My = fers fAqr- ++ pears: + ps thdqgi-++ dps; (42.12) 
so that the integral over phase space is the classical analog of the quantum 


1 Huang, op. cit., p. 76. 
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trace. Its time derivative is 


me = fof F p+ act) dq - > - dp, 


2 fo { F ends dq: - + dp; (42.13) 


Es ff G+ [,H}) das "+ + dps 


Equation (42.11) has been used in going from the first to the second line 
of (42.13), and partial integration with respect to qi - - - py has been 
used in going from the second to the third line. The agreement between 
Eas. (42.10) and (42.13), when (24.23) is used, seems surprising when the 
earlier remarks concerning our use of the Schrédinger picture are recalled. 
However, this agreement is to be expected, since the numerical values of 
matrix elements such as (a|Q|a), from which (&),, is computed, are inde- 
pendent of the picture. 


PROJECTION OPERATOR FOR A SPIN 3} PARTICLE 


The foregoing formalism is conveniently illustrated with a spin } particle, 
since in this case a pure state can be specified by only two parameters. 
Such a particle in the state |a) can be represented by a normalized spin 
wave function, or spinor, that is a generalization of the v(4) given in (41.4): 


awe = (al = [at,a% (42.14) 
(ala) = |or|? + lal? = 1 


The four parameters associated with the two complex numbers au, a2 are 
reduced to two by the normalization condition and the irrelevance of the 
overall phase. The generalization from v(4) to |a) consists merely in a 
rotation acting on the ket, which can be chosen so as to bring one into 
the other (see Prob. 12). For this state, the projection operator P, is the 
2 X 2 matrix 


|a|? aas 
= joel = [2] tat.oz1 = [lenf, mat (42.15) 
It is easily verified that tr (P.) = 1 and that P,? = P,, as expected. 
Any hermitian 2 X 2 matrix can be written as a linear combination 
of the four linearly independent hermitian matrices 1, o:, oy, a, [see the 
discussion of Eq. (27.32)]. We write this combination in the form 


Py = ao +a:6 (42.16) 
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where the four numbers @o, @z, Gy, a, are real. Since each component of 6 
has zero trace, the condition that tr (P.) = 1 means that a = 3%. We 
also have P,? = ado? + 2aa-¢ + (a:+6)?. This may be simplified by 
noting, with the help of the first of Eqs. (41.5), that (a- 6)? = a%. Then 
the requirement that P.? = P. means that a? = 7. We can thus write 
P,, in the form 


(42.17) 


Py = 4(1 +22" 9) = | ote -| 


Taz i Way ds Tas 


where x, is a real vector of unit length. It also follows from (42.2) and 
(42.17) that 


(aléla) = tr (6P.) = $ tr (6) + § tr [e+ 6)6] = me (42.18) 


Comparison of Eqs. (42.15) and (42.17) shows that a pure state of a 
spin } particle can be completely specified in terms of a single unit vector. 
The two parameters referred to above are now the polar angles of this 
vector. We also see from (42.18) that x. has the physical significance of 
being the expectation value of the spin operator, and so we call it the 
polarization vector that corresponds to the state a. 


DENSITY MATRIX FOR A SPIN } PARTICLE 


It is not difficult to see that the pure state |8) that is orthogonal to |a) has 
the polarization vector x3 = —ma (Prob. 13). The density matrix that 
corresponds to the probabilities pa, ps may then be obtained from Eqs. 
(42.6) and (42.17): 


P= PaPa + paPs = 3[1 + (pa — Dp) ra + 4] (42.19) 
The average expectation value of 6 that corresponds to this p is 
(8)av = tr (dp) = (pe —_ Pp) Ta (42.20) 


We see then that a statistical state for a spin } particle has a polariza- 
tion vector whose length is less than the unit value that corresponds to a 
pure state. 


POLARIZATION VECTOR FOR A SPIN s:PARTICLE 


The spin angular momentum of a particle with spin s can be represented 
by matrices of the form (27.26) with 2s + 1 rows and columns. A pure 
state |) is then a one-column matrix with 2s + 1 rows, which can be speci- 
fied (apart from normalization and overall phase) by 2(2s + 1) — 2 = 
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4s real parameters. The projection operator analogous to (42.15) is 


Q1 
P,=|aXal =] - [lef >> ofl (42.21) 


Qos41 


Such a hermitian matrix can be written as a linear combination of 
(2s + 1)? linearly independent hermitian matrices. It is convenient for 
one of these to be the unit matrix and for the remaining (2s + 1)? — 1 
matrices to have zero trace [see the discussion of Eq. (27.33)]. The latter 
can be chosen to represent the multipole moment operators that are per- 
mitted for a particle of spin s (see the remark at the end of Sec. 28). For 
example, with s = 4, the remaining 2? — 1 = 3 matrices represent the 
spin or dipole moment operator, and with s = 1, the remaining 3? — 1 = 8 
matrices represent dipole and quadrupole moment operators. 
In the general case, we can put, in analogy with (42.16), 


Pima bases P. (42.22) 


where as before ao, a, @y, a, are real. 4 is now the generalization of the 
Pauli matrices to higher spin and is the appropriate set of matrices (27.26) 
divided by sh. The remainder P%, includes all the higher multipole 
moment matrices and is defined to be orthogonal to 1, ¢ in the sense that 


tr (1P,) = tr (6P,) = 0 (42.23) 


Equation (42.23) is a natural generalization of the situation with spin 3, 
in which the trace of the product of any two of the four matrices 1, oz, 
gy, 7, i8 zero (see also Prob. 14). It follows from (42.22), (42.23), and 
Prob. 15 that 

tr (Ps) = (28+ Dag=1 me = tr oP.) = ETVESTD, 

(42.24) 

where =, is the polarization vector. 

The transition from a pure state to a statistical state is effected as 
before and leads to the polarization vector 


(é)av = tr (6p) = ) pata (42.25) 


where the summation is over 2s + 1 orthonormal states. 


PRECESSION OF THE POLARIZATION VECTOR 


As a simple example of the equation of motion (42.10), we now consider 
the precession of a spin s particle in a magnetic field. We assume that 
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the particle has the magnetic moment operator 
u = 7S = yshe (42.26) 


where 7 is called the gyromagnetic ratio. The hamiltonian that describes 
the interaction of the spin with a magnetic field H is then 


H = —u-H = —yshé-H (42.27) 


The rate of change of the polarization vector is given by Eq. 
(42.10): 


& (@)ee = 4 tr ((6,Hle) (42.28) 


since 6 has no explicit dependence on the time. The angular momentum 
commutation relations (27.15), together with (42.26), give 


éxé= <6 (42.29) 


Thus, from (42.27) and (42.29), we obtain for a typical component of 6: 


[o,,H) = —~ysh{{o2,0y)Hy + [o2,02]H:z) 
= —yiR(oHy — ovHs) = vih(6 x H)- 


Substitution into (42.28) then gives 
© (aev = ¥ tr [(6 x H)p] = ¥(6)ev X H (42.30) 


Equation (42.30) is just the classical equation for the precession. This 
agreement between quantum and classical equations of motion can also 
be justified by an argument based on Ehrenfest’s theorem.! 


43LJREARRANGEMENT COLLISIONS 


The calculation of the elastic and inelastic scattering of an electron from 
a hydrogen atom was discussed in Sec. 38, with neglect of exchange 
between the incident and atomic electrons. Electron exchange is a 
special case of a rearrangement collision, in which the component parts 
of the colliding systems are redistributed during the scattering process. 
We first set up the formalism for the general case and then apply it to 
electron-atom scattering. The identity and spin of the incident and 
atomic electrons introduce additional effects that must be taken into 
account. As examples, we shall discuss exchange collisions of electrons 
with hydrogen and helium atoms by means of the Born approximation. 


1 F, Bloch, Phys. Rev. 70, 460 (1946). 
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NOTATION FOR REARRANGEMENT COLLISIONS 


The collision of two systems a and b may result in the same final systems 
in their original states (elastic scattering) or in different states (inelastic 
scattering). Or it may result in different final systems that are formed 
by rearrangement of the component parts of aand b. We shail consider 
the reaction a + b— c+ d and not take identity and spin of the parts 
into account until later. The full hamiltonian can be written in either 
of two ways: 


Here H.» describes the internal and kinetic energies of a and b, and Hi, 
their interaction; H.q and H a are defined similarly. We shall assume 
that all the H’s that appear in Eq. (43.1) are hermitian; otherwise it is 
likely that H. or H.a will have absorptive parts, and there will be no 
stationary states for some of the systems a, }, c, d. 

We shall be especially interested in exact solutions xa+ and x-a~ of 
the full hamiltonian that correspond to total energy E and satisfy Lipp- 
mann-Schwinger equations that are obvious generalizations of (37.14): 


Xat = Ua + (FE — Ha + tc) Hiyxa0* 


. , (43.2) 
Xed~ = Uca + (HE — Hea — te) A gXea 
where 


(Hw — E)use = 0 (Hea — E)uca = 0 (43.3) 


ALTERNATIVE EXPRESSION FOR THE T MATRIX ELEMENT 

The 7 matrix element that describes this collision cannot be written in 
analogy with the earlier work, since in the present case there is not a 
unique interaction V. We therefore go back to the original definition 


(36.32) for the S matrix element and rewrite it in the stationary case 
with the help of (37.1): 


(B|(S = 1)|a) = ($s, (Wat = Ga)) tm, 
= (up, (Xe* = Ug) eC!) Ep Ea)? s (43.4) 
Also the first of Eqs. (87.2) is 


(a\(S — la) = — f (B|T|a) [™* g(tesmers #0 dt (43.5) 


since the interaction in the initial state is turned off at T,. We now 
follow the general procedure illustrated by the propagator in (37.14) and 
replace E. by E. + ze on the right sides of (43.4) and (43.5), where the 
limit e— 0+ is understood. This causes the integral in (43.5) to converge 
at its lower limit, so that g(t) can be replaced by unity. Then, with 
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Es = E,, we obtain 
(B|T a) = te(up, (xa — Ua)) (43.6) 


Although this derivation is somewhat heuristic, it is easily verified 
that (43.6) agrees with the second of Eqs. (37.2) in the simpler case con- 
sidered earlier (see Prob. 17). For our present purpose, Eq. (43.6) has 
the advantage of not involving the interaction explicitly and hence being 
applicable when the interaction is different in the initial and final states. 


T MATRIX ELEMENT FOR REARRANGEMENTS! 


We can now start from the expression (43.6) that is appropriate for a 
rearrangement collision and derive from it a form that is similar to (37.2). 
We first introduce E — H.g + te operating on the right member of the 
inner product, and its reciprocal adjoint on the left member: 


(cd|T|ab) = te(tea, (xav+ — Uad)) 

te((E — Hea — t€)~ ca, (E — Hea + 2€)(xavt — tas)) 
The second of Eqs. (43.3) shows that the left member of the inner product 
is now (—7e)~!ueg; since the complex conjugate of the left member is to 
be taken, the factor (—ze)~! cancels the factor ze preceding the term 


in brackets. Equation (43.1) can then be used to replace H.g by 
H.» + Hi, — Hi, in the right member of the inner product: 


(cd|T\ab) = (tea, (E — Hap + te + Hig — Hoy) (xas* — tas)) 
= (uca, (E — Has + t€)(xavt — Uan)) 
+ (tea, (Hg — Hin) x0s*) — (Ue, (Hcg — Hey) as) (43.7) 
Substitution for xa+ — ue from the first of Eqs. (43.2) shows that the 
first term on the right side of (43.7) is equal to (uca,H%xast), so that it 


cancels part of the second term. Further, the last term on the right side 
vanishes since 


(tea, (Hig — Hy) ar) = (tea, (Has — Hea) tas) 
a (tea, H abUar) —_ (H eatica;Uad) = 0 (43.8) 


with the help of (43.3). 
We thus obtain 


(cd|T |ab) = (teasH¢axas*) (43.9) 


which is similar in structure to (37.2). An equivalent expression in which 
the roles of initial and final states are interchanged can be obtained 
either by starting over with a form analogous to (43.6) or by manipulating 


1B. A. Lippmann, Phys. Rev. 102, 264 (1956); E. Gerjuoy, Ann. Phys.(N.Y.) 5, 58 (1958). 
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(43.9): 
(cd|T ab) = (xed~)Hatat) (43.10) 


Equations (43.9) and (43.10) are usually referred to as the post and prior 
forms of the 7 matrix element, respectively, since the first involves the 
interaction in the final state and the second that in the initial state. 
These two equations show incidentally that the reaction does not occur 
if there is no interaction in either the initial or the final state, as would 
be expected. 

As in Sec. 38, the Born approximation consists in using (43.2) to 
replace xast by ua in (43.9) or xed~ by weg in (43.10). The two expressions 
obtained in this way are equal on account of (43.8): 


(cd|T|\ab)s ar. (ued, H ,qttab) = (UeayH 4yUod) (43.11) 


PRESENCE OF A CORE INTERACTION 


A situation of particular interest occurs when part of H 7» and Hi, arises 
from the interaction of one of the colliding systems with an infinitely 
massive core in the other. An example is provided by the exchange scat- 
tering of an electron by an atom, in which the atomic nucleus is so massive 
that it can be regarded as such a core of infinite mass. We have already 
seen in Sec. 38 that, in nonexchange scattering, the interaction between 
the incident electron and the nucleus cannot lead to excitation of the atom 
(inelastic scattering), because of the orthogonality of the initial and final 
wave functions of the atomic electron. In somewhat similar fashion, we 
might expect that exchange scattering can be caused only by interaction 
between the incident and atomic electrons that exchange places and not 
by interaction of either electron with the nucleus. Yet Eqs. (43.9) to 
(43.11) do not suggest in any obvious way that the contribution of this 
core interaction to the 7 matrix element is zero. We now show rather 
generally how the core interaction can be eliminated.’ 
The reaction we consider may be represented schematically as 


1 + (2, core)a—> 2 + (1, core), 


where, for example, the first parenthesis indicates that 2 is bound to the 
core in state a. The full hamiltonian is 


Hee Ke Us Ua Vix (43.12) 


where K; and K; are the kinetic energy operators for 1 and 2, U; and U2 
are the interactions of 1 and 2 with the core, and Viz is the interaction of 
1 We follow the derivation given by T. B. Day, L. 8. Rodberg, G. A. Snow, and J. 


Sucher, Phys. Rev. 128, 1051 (1961). The extension to the case in which the core is 
not infinitely massive is discussed in footnote 6 of this paper. 
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1 and 2 with each other. The quantities defined in connection with Eq. 
(43.1) are then 


HAa= Ki+ K.+ U, Hi, = Ui+ Vie (43.13) 
Hea = Ki+ K2+ U, Hi, = U2t+ Vie 
We now call the exact initial and final solutions xa+ = x;+ and xea~ = xy7, 


and the unperturbed initial and final solutions wa = ue(1)we(2) and 
Uca = We(1)ug(2). Here wa and w, are the core-bound initial and final 
wave functions that satisfy the equations 

(Ke + U2 = €a)Wa(2) = 0 (Ky + U;, = €) w,(1) = 0 


43.14 
€a, & < 0 ( ) 


and uz and ug are the corresponding free wave functions that satisfy 
(Ki — E + e)ua(1) = 0 (Ko — E + @)us(2) = 0 (43.15) 
The “‘post’’ form of the exact T matrix element (43.9) is then 


(cd|T ab) = [weug, (U2 + Vie)x:*] (43.16) 


ELIMINATION OF THE CORE TERM 


Our objective is the elimination of the U2 term in Eq. (43.16). The only 
general way in which to accomplish this is to rewrite the matrix element 
so that U. appears only between exact states of 1 that correspond to 
different energies; then the orthogonality of these states, together with 
the fact that U. does not depend on the coordinates of 1, will cause this 
term to vanish. Thus we shall require, in addition to the exact core- 
bound state w,(1) that corresponds to energy ¢, the exact core-scattering 
state x.+(1) that is analogous to ue(1) and corresponds to the energy 
E — «&. We therefore introduce the equations 


(Ki +Ui-—-E+ €a)Xat (1) = 0 
(Ke + U2 — EB + &)xg-(2) = 0 (48.17) 


Lippmann-Schwinger equations of the form (37.14) for x;+ and of 
the form (37.16) for xs~ are the only ones needed if we start from the 
‘“‘nost”’ expression (43.16): 


xt = xat(1)wa(2) + (HE — Ki — Ke — U1 — U2 + te)~ Vi axit 
(43.18) 
x-(2) = ug(2) + (E — & — Ky — Us — 6)—U gu (2) (43.19) 


Equation (43.19) can be multiplied through by w,(1), which commutes 
with (EH — e — Kz — U2 — %)—'U; since this operator does not involve 
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the coordinates of 1: 


we(1)xp7 (2) = we(1)u,e(2) 
+ (EF —e — K,— U2—- te)~!U gun(1)ug(2) (43.20) 


At this point we must make the assumption that the core is infinitely 
massive, since we shall soon have to make use of the fact that the inverse 
operator in (43.20) can be the adjoint of that in (43. 18). The two are 
adjoints if we can use the second of Eqs. (43.14) to replace ew,(1) by 
(Ki + U1)w(1) in (43.20), and this replacement is valid only if Ki + Ui 
commutes with U,. Suppose now that the core has finite mass. Then 
we must work in the center-of-mass coordinate system, and the total 
kinetic energy operator cannot be separated into parts Ki and K; that 
commute with U, and U,, respectively. This corresponds physically to 
the fact that the recoil of the finite-mass core under the impact of 1 can 
shake off 2, so that the interaction with the core does, in fact, contribute 
to exchange scattering. 

With a core of infinite mass, we can replace (43.20) by 


we(1)xe~(2) = we(1)ue(2) 
+ (E — K, — K.—-— Ui — U2z— te)—1U guy (1) ug (2) (43.21) 


Equation (43.16) can then be rewritten in a series of steps that make use 
of some of the foregoing relations: 


(cd|T|ab) = (weus,U2x:+) + (wexe7)V r2xs*) 
— [(E — Ki — Kz — Ui — U2 — te) U 2woug, Viex:*] 
= (weg, Uoxs*) + (wexe-,Vi12x:*) 
= [weus, UAE _ K, _ Ke = U, —< U, + te)! V 10x*] 
= (weg, Uoxit) + (wexe ,Viexs*) 
— [weug, Us(xit — xatwa)] 
= (wexs,Viexit) + (wows, U2xat Wa) (43.22) 
The second term on the right side of (43.22) vanishes because of the 
orthogonality of w(1) and x.*(1), as was remarked at the beginning of 


this subsection. We thus obtain for the ‘post’ form of the exact T 
matrix element 


(cd|T|ab) = [we(1)xs~ (2), Viexs*] (43.23) 
In similar fashion, the “prior” form can be written 
(cd|T|ab) = [xs~, Vi2xet(1)wa(2)] (43.24) 


These expressions show at once that the process does not occur if there 
is no interaction between 1 and 2. 
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Equations (43.23) and (43.24) can be used to obtain a distorted 
wave Born approximation expression for the T matrix element, which is 
of first order in Vi. but exact in U; and U2. This is accomplished by 
replacing x,+ by xat(1)wa(2) from (43.18) or xs~ by we(1)x,s~(2): 


(cd|T|ab)pwaa = [we(1)xe~(2), Viexat(1)wa(2)] (43.25) 


The Born approximation is obtained by replacing xg~(2) by ug(2) from 
(43.19) and xat(1) by ue(1): 


(cd|T|ab)s = [we(1)ug(2), Virte(1)wa(2)] (43.26) 


EXCHANGE COLLISIONS OF ELECTRONS WITH HYDROGEN 


As a first example of a rearrangement collision in which effects of identity 
and spin appear, we consider the elastic scattering of an electron from a 
hydrogen atom. Ina problem of this type, we must know the asymptotic 
forms of the unsymmetrized wave function for all permutations of identi- 
cal particles.. A wave function that has the proper symmetry can then 
be constructed by the methods outlined in Secs. 40 and 41. 

Since there are two electrons, we require asymptotic forms for the 
exact unsymmetrized wave function x;*+(r1,r2) when 7: is large and also 
when fr, is large. In the first case, the asymptotic behavior is of the 
general form (38.18): 


xt (11,52) TERE Clexp (Ka = T1) ss ry tetkarif py (k,,Ka)|wa(ts) fs aa 
(43.27) 


Here fp is the direct or nonexchange elastic scattering amplitude, for 
which the incident electron is scattered and the atomic electron is left 
in its original state; the dots represent other terms in the series of (38.18) 
and correspond to excitation of the atomic electron. As in (38.18), k, is 
in the direction of r1. The asymptotic behavior when 7’, is large has the 
form 


xt (11,12) = Cro lettarsfin(Kk,,Ka)Wa(¥1) + °° (43.28) 


Here fz is the exchange elastic scattering amplitude, the dots again repre- 
sent atomic excitation, and k, is now in the direction of r.. There is no 
plane wave state in (43.28), since wa(rz) vanishes when 7, is large. 

As in Secs. 40 and 41, we want to work with the symmetric and 
antisymmetric combinations x;+(r1,r2) + x:+(r2,r1) and find their asymp- 
totic forms as r; or r2 becomes infinite (because of the symmetrization, 
either one will do). Our immediate problem, then, is to express the 
direct and exchange amplitudes fp and fg in terms of the corresponding 
T matrix elements, which we know. The exact direct 7 matrix element 


1 J. R. Oppenheimer, Phys. Rev. 32, 361 (1928). 
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is, from (38.21), 


: ] 
Tp = |C|?e? i exp (—tkg + 11) wz (2) (=. — ) xit (11,82) dri d*re 
(43.29) 


Similarly, the “post” form of the exact exchange T matrix element is 
obtained from (43.23): 


Te = \C\’e? ff we (t1)xe~ *(r2) os xit(11,02) d*ri d*re (43.30) 
T12 
We already know the relation between fp and Tp from (37.21) or (38.21): 
ae 
fo=- hICE Tod (43.31) 


Although it seems likely that the relation (43.31) also holds between 
fz and Tz, it is not obvious that this is the case, and we shall give a proof 
in the next subsection. Before embarking on this, however, it is inter- 
esting to compare the structures of Eqs. (43.29) and (43.30), both of 
which are exact. The expression for Tp is a matrix element of the full 
interaction, including the core term —e?/r, and with the free final state 
exp (ikg- 11) for the outgoing electron. We might have written 7T'z in 
an analogous form, by making use of (43.16). However it is preferable 
for subsequent perturbation approximations to express Tg as a matrix 
element of only the interelectron interaction e?/r12; the electron-nucleus 
interaction is then taken into account through the use of the exact core- 
scattering wave function xg~(rz) for the outgoing electron. 


RELATION BETWEEN AMPLITUDE AND MATRIX ELEMENT 


We now show that the exchange scattering amplitude and 7 matrix 
element, fz and Tx, are related by (43.31). The interaction is different 
in the initial and final states of a rearrangement collision; thus it is 
advantageous to start with the expression (43.6) for the T matrix element 
‘since it does not involve the interaction explicitly. As a check on the 
calculation, we first rederive Eq. (43.31) and then show how thé deriva- 
tion can be extended to the exchange case. 

We wish then to evaluate the integral 


Tp = tef[C* exp (—7ks- 11) wa (to) [xi* (t1,82) 
— C exp (ika +11) Wa(f2)] d*rid'rz (43.32) 
where the limit ¢— O+ is understood. Since « multiplies the integral, 
only terms that are proportional to «~' are of interest. The contribution 


to the integral from finite values of 7; is finite; hence this part of Tp 
vanishes in the limite > 0. Thus we can use the asymptotic form (43.27) 
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for x,:+(r1,%2). The excitation terms that are represented by dots can be 
omitted since w’s that correspond to different states are orthogonal and 
vanish on integration over fo. 

The bracket in (43.32) is then proportional to wa(re2), so that the 
integration over r2 gives unity. We drop the subscript from r, and write 


Tp = te|C|2fr-} exp (—7Kg Le 9 + tke’) fo(K,,Ka) d'r (43.33) 


As in the derivation of (43.6), we replace EZ, by E. + ie or, equivalently, 
ka by ka + tem/h?k,. It is convenient to choose the direction of ks as the 
polar axis for the integration and to denote the polar angles of r or k, with 
respect to this axis by 6, ¢. With w = cos @, fp is some function of w 
and ¢, and we may write 


Tp = telC|? f r! exp | - he r+ tkar(1 — u) | fotw,9) d*r 
(43.34) 


since kg = kg. 

It is most convenient to perform the w integration first by using the 
partial-integration or stationary-phase method discussed in connection 
with Eq. (20.14): 


ii fo(w,d)etart—w) dy = aa [fo(1,@) — fo(—1,¢)e**e"] 


os iG ie ip ear») dy (43.35) 
On substitution into (43.34), the first two terms on the right side of Eq. 
(43.35) make contributions to the r integration that are proportional to 
e—' and [e — (27h?k,?/m]-', respectively. The integral on the right side 
of (43.35) acquires higher powers of r in the denominator on further 
partial integrations, so that its leading contribution to the r integration 
is of order In «. Further, fp(1,¢) is evidently independent of ¢ and is 
equal to fp(ks,k.). We thus obtain, on taking the limit « — 0+, 


Tp = - 2ehicl fo(Ke,Ka) (43.36) 


This is the same as (43.31). 

The exchange T matrix element can be calculated in similar fashion, 
starting from (43.6). Instead of (43.32), we now wish to evaluate the 
integral 

Tg = tef fw3(t1)C* exp (— akg - r2)[xi+(11,r2) | 

— Cexp (tk. + 11)wa(r2)] d?r; dr, (43.37) 
where again the limit «—» 0+ is understood. The contribution to the 
integral from finite values of r2 is finite; hence this part of Tz vanishes in 
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the limit «— 0. We can therefore neglect the second term in the bracket 
and use the asymptotic form (43.28) for x:+(t1,r2); the excitation terms 
represented by dots can again be omitted. The r, integration gives 
unity, and we drop the subscript from rz to write 


Tr = t\C|?fr-! exp (—tkg-r + tkar) fa(K,,Ke) dr 
This is evaluated in exactly the same way as (43.33) and leads to 
_ 2rh?|C]? 


m 


Tr = fn (ke,Ka) (43.38) 


EFFECTS OF IDENTITY AND SPIN 


We shall assume, as in the discussion of identical particle collisions in 
Sec. 41, that the interaction does not depend on the spin. Then all we 
need do in order to take into account the identity and spin of the two 
electrons is to form an antisymmetric wave function from products of 
x:+(r1,%2) and appropriate spin functions. The spin functions can be 
taken to be the set of four given after Eq. (41.5); however, it is simpler 
to make use of the four symmetrized combinations (41.6). The spin of 
the incident electron is not assumed to have any definite relation to 
the spin of the atomic electron. In this case we can use either of these 
sets of spin functions, calculate the scattering with each of the four. 
spin states of a set, and then average the results with equal weights 
for each state.1 The first three of the spin functions (41.6) are sym- 
metric and must be multiplied by the antisymmetric space function 
xit(ri,%2) — x:*(r2,61); the fourth spin function is antisymmetric and must 
be multiplied by x;*(r,re) + x+(r2,81). 

' The asymptotic forms of the symmetrized space functions for large 
values of one of the electron coordinates, say 71, are obtained from (43.27) 
and (43.28): 


xit (¥1,F2) + xi+(fe,81) ae C{exp (ike *11) 
+ ry) e*an[fp(0) + fr(8)|}walte) + --- (43.39) 


The dots represent atomic excitation, and 6 is the angle between r: and 
k,. The differential cross section must be computed with the upper sign 
in one quarter of the collisions and with the lower sign in three quarters 
of the cases. We thus obtain 


o(6) = 4\fp(9) + fx(9)|? + t1fo(9) — fa(@)|? (43.40) 


Equation (43.40) can also be derived without explicit reference to 
the spin wave functions, as was Eq. (41.2), by making use of the earlier 


1 This is the fundamental postulate of quantum statistical mechanics referred to in 
the derivation of Eq. (41.2). 
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observation that particles that have different spin components are dis- 
tinguishable. If half the collisions, the electrons have different spin 
components, and the cross section is just the sum |fp(@)|? + |f2(6)|? of 
the direct and exchange cross sections; in the other half, the electrons 
are indistinguishable, and the antisymmetric space function must be 
used. We thus obtain 


o(8) = 3(Ifo(9)|? + |f2(@)|?) + 4] fo(0) — f2()|? 


which is easily seen to be the same as (43.40). 


EXCHANGE COLLISIONS WITH HELIUM 


We now consider the elastic scattering of an electron from a helium atom 
in its ground state and again assume that the interaction does not depend 
on the spin. According to the discussion of Sec. 41, the two electrons in 
the helium atom are in a symmetric space state and an antisymmetric 
(singlet) spin state. We therefore write the partially symmetrized wave 
function that corresponds to incident electron 1 and atomic electrons 2 
and 3 as the product of a space function x;+(ri,r2,r3) that is symmetric 
in r, and r3, and a spin function v(1,2,3) that is antisymmetric in 2 and 3. 
The eight spin functions for three electrons are grouped in (41.9) according 
to their symmetry in 2 and 3; it is apparent that v(1,2,3) must be one of 
the last doublet pair given there. 

A straightforward extension of the earlier work leads to the asymp- 
totic forms 


xi*(F1,02,03) ——> Clexp (ike + 1) + rite*anfp(6)]wa(r1,82) 


——> Cre etha rf (0) wa(ts,01) (43.41) 


rr 


ae Crs tearsfz (8) wa(t1,52) 


where wa is the helium ground-state space function and excited states are 
ignored; 6 is the angle between the asymptotic coordinate and k,. The 
direct amplitude fp is related to T’'p by (43.31), where now 


Tp, = |C|?e? [ff exp (—7kg +11) w* (r2,rs) 


(3. ne — 7) xit(¥1,%2,%3) dr) d'r2 d'r3_ (43.42) 


Tie T13 Ty 


The exchange amplitude fz is evidently the same for the last two of the 
asymptotic forms (43.41). It is related to Tx by (43.38), where now 


Deca If wa (¥1,T2)xe~*(ra) (4 + =) 


113 Toz 


xi*(E1,%2,r3) dr; d'r2 d’rg_ (48.438) 
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The completely antisymmetric wave function for the three electrons 
is easily seen to be 


xi+(11,82,%3)v(1,2,38) + xit (¥2,%3,%1)0(2,3,1) + xit(r3,41,T2)0(3,1,2) 
(43.44) 


We wish now to calculate (43.44) when one of the electron coordinates, 
say 1, is large and the other two electrons are in the ground state of a 
helium atom. This means that we take the asymptotic form in 7; and . 
also project the spin functions onto v(1,2,3); with the help of (43.41), 
this is 
C{exp (ka +01) + rite*n[fn(0) + fe(8)vt(1,2,3)v(2,3,1) 
am fe (0)vt(1,2,3)v(3,1,2)]} wa(t2,r3)(1,2,3) (43.45) 
We take for »(1,2,3) the next to the last spin function of (41.9); then 
vt (1,2,3)0(2,3,1) 
= H4+-)t — (4-4-4 4) — (4+-)] = -4 


and also vt(1,2,3)(3,1,2) = —4. The differential cross section obtained 
from (43.45) is then 
o(6) = |fp(6) — fe(6)|? (43.46) 


Like Eqs. (41.2) and (43.40), Eq. (43.46) can be derived without 
explicit reference to the spin functions. Since the two atomic electrons 
must have antiparallel spins (singlet state) in order for the helium atom 
to be in its ground state, the spin component of the incident electron is 
the same as that of one of the atomic electrons and is different from that 
of the other. It cannot exchange with the latter in an elastic collision, 
since then both the resulting atomic electrons would be in the same spin 
state and the exclusion principle would force the atom into an excited 
state. Thus it can exchange only with the electron with which it is 
indistinguishable, so that the antisymmetric combination of direct and 
exchange amplitudes must be used; this gives Eq. (43.46). 

In the absence of spin-dependent interactions, the excitation of a 
triplet state of helium by electron impact can be accomplished only by 
exchange between the incident electron and one of the atomic electrons. 
In this case, there is no direct amplitude and hence no interference 
between direct and exchange amplitudes. 


PROBLEMS 


1. Find a set of six 2 X 2 unitary matrices that represent the 3! = 6 permutations 
of three objects. 
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2. Show that the antisymmetric wave function given in Eq. (40.11) vanishes if there 
is an identical linear relation between the functions vq, vg, . . . , 0». 


3. Show that, if a wave function u(1,2, .. .,n) is an energy eigenfunction of a 
symmetric hamiltonian that corresponds to a nondegenerate eigenvalue, it is either 
symmetric or antisymmetric. 

4. Make use of Eqs. (41.5) to verify that the spin wave functions given in (41.6) are 
eigenfunctions of (S, + Sz)? and Si, + S2, with the indicated eigenvalues. Show 
also that the result of operating on these functions with the z and y components of the 
total spin is in agreement with the appropriate matrices given in Eqs. (27.26). 

5. Carry through the calculations of Prob. 4 for the spin functions given in (41.9). 


6. Use Eq. (41.2) to derive an expression for the scattering of protons on protons in 
the center-of-mass coordinate system, assuming that the coulomb interaction extends 
in tor = 0. Discuss the classical limit of the cross section (A — 0), particularly in 
the neighborhood of @ = 90°, and show that the interference term drops out if the 
average scattering over an arbitrarily small but finite range of angle is computed. 

7. Show that the 1s2p configurations in helium can be treated separately from the 
1s2s configurations so far as the first-order energy-level calculation of Sec. 41 is 
concerned. 

8. Show how the matrices found in Prob. 1 can operate on either pair of the doublet 
spin states (41.9) that have the same m value, to give the correct symmetry properties. 
9. What would be the unperturbed ground-state wave functions of helium if each 
electron had spin angular momentum # and obeyed Einstein-Bose statistics? 

10. Write down the unperturbed ground-state wave function for a neutral lithium 
atom. 

11. Use the coordinate representation to verify that the second of Eqs. (42.3) is 
valid, that is, that tr (Pa) = 1. 

12. Make use of the answer to Prob. 9, Chap. 7, to find the unitary matrix that 
rotates the state v(%) into the state |a) that has polarization vector x with polar 
angles 6a, ¢a. Then show that the P.’s calculated from Eqs. (42.15) and (42.17) 
agree with each other. : 

13. Show that, if two pure states of a spin 3 particle are orthogonal, the polarization 
vectors for these states are equal and opposite. 


14. Show that the trace of the product of any two perpendicular components of 
6 for a spin s particle is zero. 


15. Evaluate the trace of the square of any component of ¢ for a spin s particle. 

16. Calculate the polarization vector explicitly for an arbitrary pure state of a spin 1 
particle. Show that the length of this vector is less than or equal to unity. Find 
the condition on the state such that the length is equal to unity. 

17. Show explicitly that the expression (43.6) for (8|T'|a) is the same as that given 
in the second of Eqs. (37.2). 

18. Derive Eq. (43.10) by manipulating Eq. (43.9). 

19. Show by direct calculation that Eq. (43.40) is obtained if the incident and 
atomic electrons are assumed to be described by the four spin functions (++), (+—), 
(—+), and (— —), rather than by the triplet and singlet combinations. 

20. Make use of Eqs. (43.30) and (43.38) to obtain the Born approximation for the 
exchange scattering amplitude of an electron from a hydrogen atom. Evaluate this 
in the forward direction, and compare the result with the corresponding direct 
scattering amplitude obtained: from Eq. (38.28). 


T1 
Semiclassical Treatment 
of Radiation 


No account has thus far been given in this book of the interaction 
between material particles and electromagnetic radiation. As would be 
expected, a treatment that is consistent with the foregoing theory of 
material particles requires that quantum equations of motion of the 
electromagnetic field be found that are analogous to Maxwell’s equations. 
Indeed, it is only in this way that Planck’s original quantum hypotheses 
can be fitted into a general theoretical framework. The development of 
the elements of a quantum theory of radiation will be postponed until 
Chap. 14. In the present chapter we treat the electromagnetic field 
classically and the particles with which the field interacts by quantum 
mechanics. Such a semiclassical treatment is bound to be incomplete 
and not wholly satisfactory, although it is simpler in principle than 
the quantum field theory presented in Chap. 14. We shall find that 
it is possible in this approximate way to give a plausible and correct 
account of the influence of an external radiation field on a system of 
particles (absorption and induced emission) but not of the influence of the 
particles on the field (spontaneous emission). Nevertheless, the results 


; 397 


398 QUANTUM MECHANICS 


of the classical treatment of the latter phenomenon can be converted to 
quantum theory in a correct, if not very convincing, manner. Some 
simple applications of the theory are given in Sec. 46. 


44C/ABSORPTION AND INDUCED EMISSION 


The Schrodinger wave equation for the motion of a particle of mass m and 
charge e in an electromagnetic field described by the potentials A, ¢, with 
an additional potential energy V, is obtained by adding a term Vy to the 
right side of Eq. (24.39): 


3 fh A 
nB-[- Ss a. vt hy. A) 


+ 


oe 


A? + ep + v| vy (44.1) 


We regard V as the potential energy that binds the particle (of electro- 
static origin if the particle is an electron); A, ¢ represent an electromag- 
netic field that is weak enough so that those terms can be regarded as a 
perturbation. Our object is to calculate the probabilities of transitions 
between stationary states of the particle in the potential energy V, that 
are produced by the field. We first discuss some properties of the field 
and its plane wave solutions. 


MAXWELL’S EQUATIONS 


Maxwell’s equations of motion for the electromagnetic field are, in 
gaussian units, 


10H _ 10E  4r 
VxE+ oa 0 | VMN Se a (44.2) 
iba. 9g 


If the divergence of the second of these equations is combined with the 
time derivative of the third, we obtain the equation of continuity for the 
electric charge and current densities p and J: 


Vea. = _ (44.3) 


The electric and magnetic field strengths can be expressed in terms of the 
potentials by Eqs. (24.30): 


PS Sa = Ve H=vxA (44.4) 


which cause the first and fourth of Eqs. (44.2) to be satisfied identically. 
The potentials are not defined uniquely by Eqs. (44.4), since any A, ¢ that 
give the correct E and H can evidently be replaced by new potentials A’, 
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¢’ without altering the field strengths, where 


A’=A+4+ Vx Ga ai aa OX (44.5) 

c ot 
and x is an arbitrary function of r and ¢. This change in the potentials 
without changing the fields is called a gauge transformation. It is easily 


shown that ¥ must also be replaced by 
y' = petexitc (44.6) 


if the form of the wave equation (44.1) is to be preserved (see Prob. 3). 
Equations (44.6) and (44.5) are often called gauge transformations of the 
first and second kinds, respectively. 

Substitution of (44.4) into the second and third of Eqs. (44.2) gives 


107A 1 86 _ 4x 
TORN OR aan hg ae 
os (44.7) 
= i= 2 a 
rarred A+ V7 4irp 


If the vector A is written in rectangular coordinates, we can put 
VxVxA=V(V-A)—VWA 

where the last term is the vector whose components are the laplacians of 

the components of A. We can therefore simplify Eqs. (44.7) by making a 


gauge transformation (44.5) from A, ¢ to A’, ¢’ such that the new poten- 
tials satisfy the Lorentz condition: 


,, 109 | 
V-A Bier =0 (44.8) 
The gauge function x then satisfies the equation 
1 d*x 1 0@ 
Digg es nc es . np cs 
Vix — 8 on (v aa i) 
Equations (44.7) then become 
1 3?A’ 4a 
get SIO, ee a oe 
ue c* at? c J (44.9) 
1 a2¢! 
VOD hog oe 


PLANE ELECTROMAGNETIC WAVES 


If J = 0 and p = 0 (completely empty space), it can be shown that it is 
possible to choose the gauge function so that V- A’ = O and ¢’ = 0 for all 
r and t, without loss of generality (see Prob. 1). Then transverse plane 
wave solutions can be found for A’ and hence also for E and H. We drop 
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the primes and have in this case 
_ 1 ata 
c* ot? 
A typical plane wave solution of (44.10) is one that represents a real 
potential with the propagation vector k and the real polarization vector 
|Ao|: 
A(r,t) = 2|Ao| cos (k+r — wt + a) 
= Ay exp [7(k-r — wi)] +¢c. | (44.11) 
Here ‘‘c.c.’’ denotes the complex conjugate of the term that precedes it, 
and the constant complex vector Ay is defined to be |A,le**. The first of 
Eas. (44.10) is satisfied if w = kc, where k is the magnitude of k, and the 
second is satisfied if Ao is perpendicular to k. 


The electric and magnetic fields associated with the vector potential 
(44.11) are 


E = —2k|A)| sin (k-r — wt + a) 
H = —2k x |A)| sin (k-r — wt + a) 


The Poynting vector (c/4r)E x H is evidently in the direction of k; its 
magnitude averaged over a period 27/w of the oscillation is 


VA = 0 V-A=0 (44.10) 


ig? 
2re 


|Ao|? (44.12) 


where |Ao|? is equal to the scalar product of -|Ao| with itself (|Ao| - |Ao|) or 
the scalar product of Ao and its complex conjugate (Ao: Aj). The quan- 
tity (44.12) is the intensity associated with the plane wave (44.11). 


USE OF PERTURBATION THEORY 


We now return to Eq. (44.1) and use it to calculate the probability of a 
transition between stationary states that is produced by the vector poten- 
tial (44.11), which is regarded as a small perturbation. The third term 
(Vv - A) and fifth term (¢) on the right side of (44.1) are now zero. The 
ratios of the second to the first term and the fourth to the second term on 
the right side of (44.1) are of order eA/cp, where p is the momentum of the 
particle. The magnitude of this quantity is estimated in a practical case 
in Prob. 2 and is so small that the perturbation approximation is justified. 
Thus, to the first order of perturbation theory, we can neglect the term 
e?A2/2mc? and rewrite Eq. (44.1): 


ih ce = (Ho + Hv 


a (44.13) 
Moeeve H=“a.y 


Ho = — 3m, Ge 
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We proceed as in Sec. 35 and expand y in stationary eigenfunctions 
u(r) of the unperturbed hamiltonian Ho with time-dependent coeffi- 
cients a,(t). If the system is initially in the state n and the perturbation 
is turned on at ¢ = 0, the first-order amplitudes at time ¢ are given by an 
expression similar to (35.11): 


C1009) aks ee es Ree 
a) h Wkn — w h Wkn + w 
(k|H"|n) = teh ux exp (tk-r)Ao+V un a'r (44.14) 
(k|H'|n) = ich ug exp (—7k- r)Aj- Vu, d'r 


As discussed in Sec. 35, the probability of finding the system in the 
state k is appreciable only when the denominator of one or the other of 
the two terms in (44.14) is practically zero. There is no interference 
between the two terms; the first is important when FE, ~ E, + hw, and 
the second is important when E, ~ E, — hw. Thus the probability of 
finding the system in a state k that has an energy higher than the initial 
state by about fw is proportional to |(k|H’|n)|?, and the probability of 
finding the system in a state k’ that has a correspondingly lower energy is 
proportional to |(k’|H’’°|n)|?. 


TRANSITION PROBABILITY 


The discussion of Sec. 35 shows that the transition probability per unit 
time is independent of the time only if the final state can be any of a very 
closely spaced or continuously distributed group. The need for a group 
of final states arises from the dependence of the probability |a,» (é)|? on the 
energy, which is shown in Fig. 31; it is the area under this curve, not the 
ordinate at a particular absicssa, that is proportional to ¢. 

In the same way, a constant transition probability per unit time is 
obtained in the present problem if the incident radiation is monochro- 
matic (definite value of w) and transitions can occur to any of a group of 
closely spaced or continuously distributed final states. The result is Eq. 
(35.14) with either (k|H’°|n) or (k|H’’°|n) substituted for (k|H’|m). How- 
ever, the computation of a transition probability between two discrete 
states is often of interest. In this case, the transition probability per unit 
time is not constant in time, if the incident radiation is strictly monochro- 
matic, and depends markedly on the difference between w and 

|E, — E,,| 


|wrn| = hh 


What we do in this case is to assume that the radiation covers a spread of 
frequencies with no phase relations between the different frequency com- 
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ponents, so that the radiation. can be characterized by an intensity per 
unit frequency range that is constant in the neighborhood of |wsn|.1 

The probability of finding the system in the final state is then 
proportional to |(k|H’°|n)|? or |(k|H’’°|n)|?, which in turn is proportional to 
|Ao|2 and hence to the intensity. If the intensity in the small angular 
frequency range Aw is I(w) Aw, Eq. (44.12) tells us that we can put 


2 
|Ao|? = TF I(w) dw (44.15) 


where Ay is the vector potential amplitude that characterizes the fre- 
quency range Aw. The probability that a transition in which the system 
is left in a higher energy state (LE, ~ E, + hw) has taken place at the 
time ¢ is then 


AW(k|H’°ln)!2 sin? 3(wen — w)t 
“rs = Ma = 


w 


_ Sire? x 2 3 2 
= > ar Tw) Aw| f ut exp (tk+ 1) Vaun ar 
sin? 4(win — w)t 

(Win — w)? 


(44.16) 


where V4 is the component of the gradient operator along the polarization 
vector Ay. The contributions to the probability from various frequency — 
ranges are additive, since there are no phase relations between the radia- 
tion components of different frequencies. 

Each frequency range Aw in (44.16) can be made infinitesimally 
small, and the summation replaced by an integration. Since the time 
factor has a sharp maximum at w = wen, the other factors that involve w 
ean be taken outside the integral and the limits on w extended to + ©, as 
was done in going from Eq. (35.13) to (35.14). Thus the transition prob- 
ability per unit time for an upward transition becomes 


1 Sire? 
a qQ) 2h ee 
t lax (2) mCwen? I (win) 


/ ux exp (ik+r) Va un a'r ’ 


[cl sin? 3(wkn — w)b 

-0  t(wen — w)? - 
202 

Ante? T (crn) 


Mm cwkn? 


2 


i ug exp (tk+r) Va Un d*r 


(44.17) 


where the magnitude of k is now win/c. An expression very similar to 
(44.17) is obtained for the probability per unit time of a downward 
1 For a discussion of the situation in which the intensity is not constant near |wz,l, 


see W. Heitler, “The Quantum Theory of Radiation,” 3d ed., sec. 20 (Oxford, New 
York, 1954). 
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transition (Ey, ~ E, — hw): 


4r*e? 
m2 Cw ap? I (wax) 


, ut exp (—ik +1) V4 un dr | (44.18) 
In this case the magnitude of k is w,,+/c. 


INTERPRETATION IN TERMS OF ABSORPTION AND EMISSION 


Equations (44.17) and (44.18) give probabilities per unit time for transi- 
tions of the particle between stationary states under the influence of a 
classical radiation field. These expressions can now be interpreted in 
terms of absorption and emission of quanta of electromagnetic radiation. 
It is necessary to assume that such quanta exist and provide the energy 
units of the radiation field and that energy is conserved between field and 
particle. The particle gains the amount of energy E, — E, in an upward 
transition under the influence of radiation of angular frequency w;,. The 
quantum energy of this radiation is hoz, = E, — E,, so that it is reason- 
able to associate with the upward transition of the particle the absorption 
of one quantum from the radiation field. 

In similar fashion the downward transition is associated with the 
emission of one quantum whose energy corresponds to the frequency of 
the radiation field. In accordance with Eq. (44.18), the emission prob- 
ability is proportional to the intensity of the radiation present. This 
process is therefore referred to as induced emission. 

It is sometimes convenient to rewrite Eq. (44.18) in terms of the 
reverse transition to that which appears in (44.17). Equation (44.17) 
describes the transition from an initial lower state n to a final upper state 
k; (44.18) can be made to describe the transition from an initial upper 
state k to a final lower state n, if n is replaced by k and k’ by n. Then 
(44.18) becomes 

Ane? 


—s—— I (win) 


Mm corn? 


/ ux exp (—ik-r) Va uz, dr f (44.19) 


We can now show that the integral in (44.19) is just minus the complex 
conjugate of the integral in (44.17). By means of a partial integration, 
or with the help of Eq. (23.21), the integral in (44.19) is seen to be 
equal to! 


— Juz Aa [ux exp (—ik- r)] dr 
Since only the component of the gradient along the polarization vector Ao 


appears, and this direction is perpendicular to the propagation vector k, 
the operator Va does not affect exp (—7zk-r). Thus the integral in 


‘In using Eq. (23.21), it must be remembered that the operator tV, not the operator 
V, is hermitian. 
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(44.19) is equal to 
— fu, exp (—ik +r) V4 ux d?r 


and the square of its magnitude is equal to the square of the magnitude 
of the integral that appears in (44.17). 

Since (44.17) and (44.19) are the same, the probabilities of reverse 
transitions between any pair of states under the influence of the same 
radiation field are equal. 


ELECTRIC DIPOLE TRANSITIONS 


In most cases of practical interest, the wavelength of the radiation is 
many times greater than the linear dimensions of the wave functions that 
describe the motion of the particle. This means that the quantity k-r 
that appears in the exponential in the integral of (44.17) is small in com- 
parison with unity wherever u, and u, are large enough to give an appreci- 
able contribution to the integral. A good approximation is then obtained 
by replacing exp (tk-r) by 1. 

The resulting integral can be simplified by expressing it as a matrix 
element of the momentum of the particle 


Al Ut VA Un Br = 7 / ut paun Br = 5 elpaln) 


where pa is the component of the particle momentum p along the direc- 
tion of polarization of the incident radiation. The matrix theory of Sec. 
24 shows that the momentum matrix of the unperturbed particle is given 
by p = m(dr/dt). Thus from Eq. (24.16) 


1 d 
a (k|p|n) = ai (k|r|n) = twen(k|r|n) 
The integral in Eq. (44.17) becomes, in this approximation, 
i ux Va Un Br = — . Wkn / Uurrau, dr (44.20) 


where 4 is the component of r along the direction of polarization. Equa- 
tion (44.20) can, of course, also be derived without recourse to matrix 
methods (see Prob. 5). 

Transitions for which the probability can be computed by substitu- 
tion of (44.20) into (44.17) are called electric dipole transitions, since only 
the matrix element of the electric dipole moment er of the particle is 
involved.! The transition probabilities per unit time for absorption and 


1 The quantity er is the electric dipole moment of the particle of charge e with respect 
to an arbitrarily placed origin; the addition of a constant vector (corresponding 
to a shift in the origin) to r does not affect the matrix element (44.20), since ux and un 
are orthogonal. 
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induced emission then become, in the dipole approximation, 


2p2 
= T (en) |(k|ra|n)|? (44.21) 


It is convenient to denote by (k|r|n) the vector whose cartesian compo- 
nents are the kn matrix elements of x, y, and z and to put 


Kklr[n)|? = (kle|n) - (k|r|n)* (44.22) 


which is the sealar product of (k|r|/n) and its complex conjugate. The 
reason for doing this is that there are usually pairs of states k and n for 
which |(k|r|n)|? is the same but for which the vector (k|r|n) has various 
orientations in space.!_ Then, if © is the angle between (k|r|n) and the 
direction of polarization of the incident radiation, |(k|r|n)|? cos? © can be 
substituted for |(k|r4|n)|? in Eq. (44.21), and an average performed over 0. 
The average of (44.21) for such pairs of states is then 


ae T (wen) [dele in) |? (44.23) 


FORBIDDEN TRANSITIONS 


_It may happen that the dipole matrix element (k|r|n) is zero for particular 
states k and n. In that case the approximate replacement of exp (ik - r) 
by 1 in the integral of (44.17) is not justified. The exponential can be 
expanded in a power series 


exp (ik-1) = 1+ dker +3 (ik-n? ++. 


or in a series of spherical harmonies like (19.9) 
exp (2k +r) = jo(kr) + 32j:(kr)Pi(cos 6) — 5j2(kr)P2(cos 6) + - - - 


where 6 is the angle between k and r. The second series is more con- 
venient than the first if, as is usually the case, the wave functions u, and 
Un can be expressed in terms of spherical harmonics. 

With either series, the dominant factor in the nth term is propor- 
tional to (kr)” if kr < 1 [see the first of Eqs. (15.7)]. Thus if the dipole 
matrix element vanishes but the next term of each series does not, the 
transition matrix element is reduced by a factor that has the order of 
magnitude ka, where the linear dimensions of the particle wave functions 
are of order a. A transition of this type is called a forbidden transition, 


1 For example, if the particle moves in a spherically symmetric potential V(r), the 
state k can have / = 0, and the states n can have / = 1 and three values (0, +1) for 
the magnetic quantum number m. 
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since its probability is reduced by a factor (ka)? with respect to dipole 
or allowed. transitions, and usually ka<«<1. Successive terms in the 
series can be interpreted in terms of electric dipole, quadrupole, etc., 
transitions and involve successively higher powers of ka. 

If both the states u, and u, are spherically symmetric, the integral 
fux exp (ik-r) Va u, d’r is identically zero. This can be seen by choos- 
ing cartesian coordinates for performing the integration such that the z 
axis is along the direction of polarization. Then V4 u, is an odd function 
of x, whereas u; is an even function of x, and exp (7k +r) = e**t#.2) is also 
even in z, since the vector k is perpendicular to the direction of polariza- 
tion and hence is in the yz plane. Thus the integrand is an odd function 
of x, and the integral in (44.17) vanishes. The transition between these 
states is said to be strictly forbidden, since the first-order probability given 
by (44.17) is zero. It is still possible for transitions to be produced by 
higher orders of the perturbation H’ given in Eq. (44.13); in such a caleu- 
lation, the previously neglected term e?A?/2mc? must be included in H’. 
However, it can be shown with the help of quantum electrodynamics that 
such higher-order transitions involve more than one quantum and hence 
are not simple emission or absorption processes in which the quantum 
energy is equal to the energy difference between the unperturbed states & 
and n of the particle. 


45,ISPONTANEOUS EMISSION 


A classical charged oscillator can absorb energy from a radiation field or 
give up energy to it, depending on the phase relation between the field 
and the oscillator. These effects are analogous to the absorption and 
induced emission of the preceding section. A classical oscillator also 
emits radiation spontaneously, whether or not an external radiation field 
is present. In this section, we calculate the electromagnetic radiation 
from a classical oscillating charge-current distribution in the absence of 
external fields and somewhat arbitrarily rewrite the formulas in terms of 
quantum matrix elements to obtain a probability for spontaneous emission. 
The results are then verified by comparison with Planck’s expression for 
the distribution of thermal radiation in a cavity. 


CLASSICAL RADIATION FIELD 


A charge-current distribution can be completely specified by the current 
density J, since J determines the charge density p through the equation of 
continuity (44.3). In similar fashion, the electromagnetic fields in empty 
space, away from charges and currents, can be completely specified by 
either E or H, because of the connections (44.2) between them. A wave 
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equation for H is readily obtained by taking the curl of the first of 
Eqs. (44.9). 
1 3? 4nr 


VH — 
Thus H can be obtained in terms of J alone, whereas the similar equation 
for E involves both J and p (although p can, of course, be eliminated). 
We proceed by solving Eq. (45.1) for H when the three cartesian compo- 
nents of J vary harmonically in time with the same angular frequency w 
but not necessarily with the same phase: 


J,(r,t) = 2|J.(r)| cos (wt — nz) = J2(r)e™* + ee. 


45.2 
J(r) = |J.(r) |e - (45.2) 


with similar expressions for the y and z components. We are interested 
only in the steady-state solutions for E and H that have the same fre- 
quency w 


E,(r,t) = 2\|E.(r)| cos (wt — &,) = E,(r)e*t + e.c. 
H.(1,t) = 2|H.(r)| cos (wt — ¢,) = H(r)e“** + e.e. (45.3) 
Ex(s) = [Ex()le —— Ha(r) = |H,(e)]ett 


again with similar expressions for the y and z components. From the 
second of Eqs. (44.2), E is given in terms of H in empty space by 


E(r) = <V x H(r) (45.4) 
With substitutions from (45.2) and (45.3), Eq. (45.1) becomes 


(V? + k)H(r) = ~ y x J(x) a 


@) 
c 


(45.5) 


This is an inhomogeneous equation, the solution of which can be expressed 
in terms of a propagator like (37.19), and is 


H(r) = = al er s exp (¢k|r — r’|) d3r’ (45.6) 


Equation (45.6) is the retarded solution of (45.5); for large r, this solution 
is an outgoing wave that varies with r and ¢ like r—!e*@r-#) +. c.c., so that 
the field produced by a current element occurs at a later time and hence 
is retarded with respect to the current element. 


408 QUANTUM MECHANICS 


ASYMPTOTIC FORM 

We are interested in the energy and angular momentum carried away by 
the field. As shown below, the energy can be found from the leading 
terms in the asymptotic expression for the field at very large r, which vary 
as 1/r; the angular momentum requires in addition some terms that vary 
as 1/r?. The r-dependent part of the integrand of (45.6) can be expanded 
in powers of 1/r: 


exp (ik|r — r’|) 1 ( _ 17 cos 6 + Likr’? sin? 6 


1 + etk(r—r" cos 6) 
r 


lr — 1’ | roe 7 . 
(45.7) 


where @ is the angle between r’ andr. Substitution of (45.7) into (45.6) 
gives, together with (45.4), a complete specification of the asymptotic 
electromagnetic field through terms of order 1/r’?. 


RADIATED ENERGY 


The Poynting vector, which is the energy-flux vector, is equal to 
c 
An [E(,t) x H(z,t)] 
Vv 


From (45.3), we see that its time average P(r) over a period of the oscilla- 
tion has a typical component . 


P,(t) = < {|E2(r)| IH,(1) |[cos (wt — &) cos (wt — Sy) ltime av 
-— |E,(r)| |H2(r)|[cos (wt — &) cos (wt — fz) |timeav} 
c 


£- {IEs(2)| |Hy(a)| e08 (fe — $4) 
=| [E,(r)| |H.(r)|cos (ty a f2)] 


This and the other two components can be put in the form 


P(r) = = Re [E(r) x H*(r)] (45.8) 


where Re denotes the real part of the expression that follows. Now we 
are interested only in those terms in the energy flux that fall off as 1/r’, 
since only these correspond to radiated energy; we therefore require only 
the terms of order 1/r in E and H. 

It is convenient in writing explicit expressions for the fields to choose 
cartesian axes such that the z axis is along the vector r, which goes from 
the center of the charge-current distribution to the point at which the 
field is measured. Equations (45.4), (45.6), and (45.7) then give, to 


SEMICLASSICAL TREATMENT OF RADIATION 409 


order 1/r (where now r = 2), 
_ 1h ike 1) o—ike! 3p! 
H: — moe [ tee d?r’ 


tk thr I\ p—tke!’ 3y! 
Hy? e [ s-@¢ ar 
H. — 0 (45.9) 


tk tkr !\ g—ikz’ pJ8y! 
Eee f tee d®r 


tk. ~ 
Eyer f J,(2')e—* dr’ 
E. 0 


Partial integration has been used to get rid of the derivatives of J in the 
integrand of H. Equations (45.9) show that the asymptotic fields are 
transverse to the direction of propagation. They also relate the polariza- 
tion of the emitted radiation to the current distribution and show that 
only the component of the current perpendicular to the direction of 
propagation contributes to the radiated energy. Substitution into 
(45.8) gives 


Pak | f Tye" dy! ’) (45.10) 
Equation (45.10) can be generalized to give the average energy flux in 
the direction of a vector k: 

k? 


2Qrr7ec 


2 


(45.11) 


i Jix(t’) exp (—ak +r’) dr’ 


where J ,, is the component of J perpendicular to k. 


DIPOLE RADIATION 


Equation (45.11) is an exact expression for the energy radiated by the 
classical current distribution (45.2). As in Sec. 44, the electric dipole 
approximation is obtained in the long-wavelength limit by assuming that 
kr’ <1 and replacing exp (—7k-r’) by 1 in the integrand. The energy 
flux is then 

k? 


2Qrr°c 


il Jt’) dr’ ’ (45.12) 


From Eqs. (45.9) with the same approximation, it is apparent that the 
polarization (direction of the electric field) of the radiation is determined 
by the total current vector Jo = SJ(’) d*r’. The radiation is linearly 
polarized if Jo has only one component in the plane perpendicular to the 
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direction of propagation, circularly polarized if Jy has two equal com- 
ponents in this plane that are perpendicular to each other and 90° out of 
phase (so that one component is 7 times the other), etc. 

If Jo has only one component, the angular distribution of the radia~- 
tion can be found by replacing |fJ,,(r’) d?r’|? in (45.12) by 


(Jo Jo) sin? @ = |Jo|? sin? 6 


where 0 is the angle between Jo and k, and |J,|? is an abbreviation for the 
scalar product of Jo and its complex conjugate. The total power radiated 
is then the integral of (45.12) over the surface of a sphere of radius r: 


4k? 
Be (el (45.13) 


Equation (45.13) is also valid if Jo has more than one component, and 
these do not necessarily have the same phase (see Prob. 11). 


ANGULAR MOMENTUM 


The angular momentum radiated per unit time is equal to the torque 
exerted on a large, perfectly absorbing sphere that is centered at the 
charge-current distribution that constitutes the source of radiation. The 
average energy flux is P, so that the (directed) energy density is (1/c)P 
and the momentum density is (1/c?)P. Since the radiation travels out- 
ward with speed c, the torque exerted on a perfectly absorbing differential 
element of area, dA, that is perpendicular to r is ce dA times the vector 
product of r and the momentum density: (dA/c)(r x P). Integration of 
this quantity over the sphere of radius r gives the angular momentum 
radiated by the source per unit time. Thus only the components of P 
tangential to the sphere are involved: In the notation of Eqs. (45.9), 
these are P, and P,, since the z axis is along r. 

If E, and H, were zero, the tangential components P, and P, would 
also be zero, and no angular momentum would be radiated. The third 
and sixth of Eqs. (45.9) imply only that the z (radial) components of the 
field are of smaller order than 1/r; actually they are of order 1/r?. This 
means that P, and P, fall off as 1/r? for larger. Thensincer x P appears 
in the expression for the angular momentum, and the area of the absorb- 
ing sphere is proportional to r?, the total angular momentum absorbed by 
a large sphere is independent of r. 

We require the 1/r? terms in E, and H, but not in the other field 
components. 


H, + et f fy Fela!) — 2! Ty(e")le™ dr! 


(45.14) 
Ex oa ot f [2Fe(a!) + she Fele!) + thy! Fy(e’)le-™ dir! 
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Equations (45.9) and (45.14) are sufficient for an exact calculation of the 
radiated angular momentum.’ 


DIPOLE CASE 


The expressions for P, and P, are simplified by the dipole approximation 
since only the terms of lowest order in kr’ need be retained in (45.9) and 
(45.14). It is easy to see that the leading term in P,, for example, is 
—(c/2n) Re (EzH*) and not (c/2r) Re (E,H;). We obtain to lowest 


order in kr’ 
ne *_ Re (i [ er [ sta’) 


P, = is Re G if J, dir’ / Jar") 


Equations (45.15) refer to axes fixed with respect to the element dA 
of absorbing area at r. They must now be rewritten in terms of general 
cartesian coordinates, in order that the angular momentum component 
about a particular axis fixed in space can be found. This is analogous to 
the rewriting of the energy-flux expression (45.10) in the general form 
(45.11) but is somewhat more complicated. To accomplish this, we 
choose new cartesian coordinates x’, y’, 2’ that are fixed in space. With 
respect to these, the orientation of the old axes depends on r in the follow- 
ing way (see Fig. 38): the z axis is in the direction of r and has the polar 
angles 6, @ with respect to the new axes, the y axis is perpendicular to r 
and in the plane of r and z’, and the z axis is perpendicular to the plane of 
randz’. Ifnow we wish to calculate the contribution to the z’ component 
of angular momentum from absorption by the element of area 


dA =r’ sin @d0dq¢ 


Pz 
(45.15) 


at r, we need only P, given in (45.15). This can be written in terms of 
the components of the total current vector Jo along the new axes as 
P, = oo Re [i(Joz’ sin 8 cos @ + Joy sin 6 sin @ + Jo.’ cos 6) 
3 
(Jx,, cos @ — Jo, sin ¢)] (45.16) 
The moment arm about the z’ axis associated with P, is r sin 6, so that 
the differential element of angular momentum component is 


AS * sin OP sin 6 d0-dé (45.17) 


1 For a more general discussion see W. K. H. Panofsky and M. Phillips, “Classical 
Electricity and Magnetism,” 2d ed., sec. 14-10 (Addison-Wesley, Reading, Mass., 
1962); J. D. Jackson, ‘‘Classical Electrodynamics,” sec. 16.3 (Wiley, New York, 1962). 
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x 


Fig. 38 The relation between the unprimed coordinate sys- 
tem of Eqs. (45.15) and the primed coordinate system of Eq. 
(45.16). 


Substitution of (45.16) into (45.17) and integration over the polar angles 
give 


Le = ZF JoeSby — Jor Tie) (45.18) 


It is apparent from Eq. (45.18) that the radiation of a particular 
component of angular momentum depends only on the particular compo- 
nents of Jo. Moreover, there must be two such perpendicular components 
that are out of phase with each other, for if Jo, and Joy are both real or 
have the same phase, the parenthesis in (45.18) is zero. Thus a linear 
dipole (Jo entirely in one direction) radiates no angular momentum. The 
maximum angular momentum for a given value of |Jo|? is radiated when Jo 
has two equal perpendicular components that are 90° out of phase with 
each other and the third perpendicular component is zero. If the non- 
vanishing components are along x’ and y’ we can put Joy = Wo; Eq. 
(45.18) then becomes 


_ 8k 
~ 3c? 


and the other two components of L are zero. 


Ly |Joz|? = = lJo|? (45.19) 
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Comparison of Eqs. (45.13) and (45.19) shows that the maximum 
angular momentum radiated per unit time by an oscillating electric 
dipole is 1/kc = 1/w times the energy radiated per unit time. If this 
relation is taken over into quantum theory, it shows that a quantum of 
energy, iw, radiated by an electric dipole carries with it an amount of 
angular momentum that does not exceed h. 


CONVERSION FROM CLASSICAL TO QUANTUM THEORY 


We now convert to quantum theory the classical expression (45.13) for 
the power radiated by an electric dipole. This requires that we find a 
quantum analog for the total current vector Jo and associate the radiated 
power with a transition probability between states of the particle that is 
doing the radiating. 

We want to replace J by a current density that is associated with an 
initial upper state wu, and a final lower state un, since energy is radiated 
during the transition from k to n._ It is natural to represent the current 
density as the product of a charge density and a velocity and to take for 
the velocity the momentum operator divided by the mass: — (th/m)vV. 
The charge density for a stationary state is expected to be the charge of 
the particle times its position probability density: e|y|?.. However, we are 
concerned here with a transition between states, and so we replace this by 
euxu,.1 The way in which the V that appears in the velocity operates 
on the wave functions that appear in the change density is determined by 
arguments of the type presented in Sec. 7 [see Eq. (7.3)]._ We thus arrive 
at a quantity to substitute for the classical current density: 


Ja) > — ax) vue) (45.20) 


We assume that (45.20) can be substituted into all the foregoing classical 
expressions to give quantum results.? 

Integration of (45.20) over the coordinates gives the total current 
vector 


Jo= 


teh F : 
— — | ukVu, dr = —tewn f usrur Br = tewin(k|r|n)* 


(45.21) 


with the help of Eq. (44.20). Substitution of (45.21) into (45.13) then 
gives the radiated power. We interpret this power as the product of the 


1 The use of y rather than the uz has been discussed by F. E. Low, Am. J. Phys. 29, 
298 (1961). , 

2 The exponential that appears in (45.11), for example, can be placed either before or 
after the V operator, since only the component of V that is perpendicular to k enters 
in. 
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spontaneous rate of transition from k to n and the quantum energy 
horn = E, — E, given off in each transition. The transition probability 
per unit time for spontaneous emission then becomes 


4e 2k Win 
3hc 


4e2 ne 
[de lefn)|? = (dele |n|? (45.22) 


where use has been made of the relation wy, = ke. 


PLANCK DISTRIBUTION FORMULA 


The transition from the classical expression (45.13) to the quantum 
expression (45.22) can claim only a moderate amount of plausibility. 
The correctness of the latter result can, however, be verified by showing 
that Planck’s formula for the spectral distribution of thermal radiation in 
a cavity follows from (45.22) and (44.23). This is the way in which the 
relation between the probabilities for absorption, induced emission, and 
spontaneous emission was first obtained.! 

We assume that the walls of the cavity contain particles of charge e 
and mass m, each of which is bound by a potential V of the type that 
appears in (44.1). When these particles are in equilibrium with thermal 
radiation at the absolute temperature 7, there must be as many quanta 
of each frequency emitted as absorbed per unit time. The rate of 
emission of quanta of frequency wz, is the sum of (44.23) and (45.22), 
multiplied by the number of particles that are in the upper state k. The 
rate of absorption of these quanta is the product of (44.23) and the 
number of particles in the lower state n. From statistical mechanics,? 
the equilibrium ratio of the number of particles in the upper state to the 
number in the lower state is given by e~#+—¥»)/*?, where x is Boltzmann’s 
constant. We thus obtain, on dropping the subscripts from wen, 


ene [eT ES T(w) |(k|r|n) |? + ee Keng | = oe I(w)|(k|r|n)|? 


This is readily solved to give 7(w), or equivalently 


I(w) Aw _ fiw? Aw 


c ~ 7 8¢3 (ehwlkT — J) (45.23) 


Energy density = 
It is interesting to note that the parameters e, m, and (k|r|n) of the particle 
that emits and absorbs the radiation drop out of the expression for I(w). 


1 A. Einstein, Phys. Z. 18, 121 (1917). Einstein’s A coefficient is just (45.22), and his 
B coefficient is (44.23) divided by the energy density of radiation I(win)/c (this is 
expressed in terms of the angular frequency o,, rather than the circular frequency 
win/2r). 

2 See, for example, K. Huang, ‘‘Statistical Mechanics,” sec. 9.3 (Wiley, New York, 
1963). 
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The agreement between Eq. (45.23) and the Planck distribution formula 
provides a verification of the ratio of (44.23) to (45.22) and hence shows 
that the latter expression is correct if the former is. 


LINE BREADTH 


A classical oscillator that radiates electromagnetic waves loses energy, so 
that the amplitude of its oscillation decreases in time. Thus the electro- 
magnetic fields given off by it have a damped sinusoidal time dependence: 
e~*% cos (wot + a). The Fourier analysis of these fields gives the fre- 
quency spectrum of the radiation from the oscillator. The radiated 
intensity per unit frequency range at the angular frequency w is propor- 
tional to 


1 
(=o)? +a? ee 
According to (45.24), the intensity of the emitted spectral line has half 
its maximum value when w = wo + #y. The quantity y is called the 
natural line breadth and in cases of practical interest is small in com- 
parison with wo. 

The line breadth is evidently twice the initial fractional rate of 
decrease of the amplitude of the classical oscillator or is equal to the 
initial fractional rate of decrease of the oscillator’s energy. It is plausible 
to associate the rate of decrease of the energy of the classical oscillator 
with the rate of decrease of the probability of finding the corresponding 
quantum system in its initial upper state. If this is done, the quantum 
analog of the classical natural line breadth y is the initial transition 
probability per unit time for spontaneous emission given in (45.22). 
Although this result is correct if the final state is the ground state, there 
will be an additional broadening of the line if the final state has a finite 
breadth.? 

The foregoing relation between transition probability and line 
breadth can be arrived at in a qualitative but more general way by means 
of the uncertainty relation (3.3). The reciprocal of the transition proba- 
bility per unit time is of the order of magnitude of the time that the 
quantum system stays in its upper state. Thus a determination of the 
energy of the upper state cannot occupy a time that is of larger order of 
magnitude than the lifetime 1/7 of this state. According to (3.3), this 
means that the energy cannot be determined with an accuracy much 
greater than h divided by the lifetime, or Ay. If the energy of the upper 
state is uncertain by this amount, the frequency of the emitted line will be 
uncertain (broadened) by y. In general, a quantum energy level is 


1 For further discussion of line breadth, see Heitler, op. cit., sec. 18. 
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broadened by any process that shortens its lifetime; the level is perfectly 
sharp only if the lifetime of the state is infinite (true stationary energy 
eigenfunction). 

A qualitative idea of the natural breadth of electric dipole lines 
emitted by a quantum system can be obtained by rewriting the expres- 
sion (45.22) for y in the form 


2 
= FF beKkleln) 2 
The factor e?/hce is a dimensionless constant that is very nearly equal to 
t#r if e is the electronic charge,! and the factor k?|(k|r|n)|? has already been 
assumed small in comparison with unity in arriving at the dipole approxi- 
mation. Thus the ratio of line breadth to angular frequency is expected 
to be quite small (it is of the order of 10-6 for typical atomic dipole lines). 


46LISOME APPLICATIONS OF RADIATION THEORY 


The semiclassical radiation theory developed earlier in this chapter is 
applied in this section first to the determination of the conditions for 
allowed transitions and then to the theory of the photoelectric effect. 


SELECTION RULES FOR A.SINGLE PARTICLE 


The discussion of forbidden transitions at the end of Sec. 44 shows that 
the probabilities for absorption and induced emission are reduced by a 
factor of at least (ka)? with respect to allowed transitions if the dipole 
matrix element (k|r|n) vanishes. The same remarks apply to the proba- 
bility for spontaneous emission, since the integral in (45.11) is the same 
as that in (44.19) when the substitution (45.20) is made for J. 

The conditions on u, and u, for which the dipole matrix element is 
different from zero constitute the selection rules. They are easily formu- 
lated if the potential V that appears in the unperturbed hamiltonian of 
(44.13) is spherically symmetric. It is shown in Sec. 14 that the energy 
eigenfunctions can then be written as products of functions of the radical 
distance r and spherical harmonics Y;,,(0,¢) defined in Eq. (14.16). The 
matrix element (k|r|n) is the vector whose cartesian components are the 
corresponding matrix elements of z, y, and z. The matrix element of 
zis fur cos 0u, d*r, which can be written as a product of an integral over 
r and the angular integral 


JF " Y%(6.9) 008 6 Yyq(0,4) sin 6 d0 de (46.1) 


1 This is the fine structure constant that appears in the theory of the fine structure 
of atomic energy levels (see Chap. 13). 
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where the primed and unprimed subscripts are the angular momen- 
tum quantum numbers for the lower state u, and the upper state uz, 
respectively. 


The ¢ integration in (46.1) is i: 7 gilm'—m)o d@, which is zero unless 
/ 


m' =m. The integration can then be written, apart from numerical 
factors, 


es wPy"(w)Py™(w) dw w = cos 6 (46.2) 


Now it can be shown with the help of the generating function (14.13) for 
the associated Legendre functions that 


1+ |ml 1 — |m| +1 
2i+ 1 2i+1 


Substitution of this into (46.2) shows, with the help of the orthogonality 
integral (14.15), that the matrix element of z vanishes unless m’ = m 
and il’ =1+1. A similar treatment shows that the matrix element of 
x + iy vanishes unless m’ = m — 1 and l’ = 1 + 1, and the matrix ele- 
ment of x — iy vanishes unless m’ = m+ 1 and /’=1+1. These 
selection rules also follow from the discussion at the end of Sec. 28 and 
determine the possible allowed (electric dipole) transitions of a single 
charged particle that moves in a central force field. 


wPi"(w) = 7” 1(w) + Paw) 


POLARIZATION OF EMITTED RADIATION 


The discussion following Eq. (45.12) shows that the polarization of the 
emitted radiation is determined by the total current vector Jo and hence 
[because of (45.21)] by the dipole matrix element. When the initial and 
final states have / values that differ by unity and the same magnetic 
quantum number m with respect to the z axis, only the matrix element of 
z fails to vanish. The radiation is then linearly polarized along the z axis 
if viewed in the xy plane, and there is no radiation along the z axis. When 
the magnetic quantum numbers of the initial and final states differ by 
unity, the x and y components of the dipole matrix element are 90° out 
of phase, and the z component vanishes; the radiation is then circularly 
polarized if viewed along the z axis and is linearly polarized perpendicular 
to the z axis if viewed in the zy plane. These results are of interest in 
connection with the polarization of the radiation from atoms placed in 
a magnetic field (see the discussion of the Zeeman effect in Sec. 48). 


CONSERVATION OF ANGULAR MOMENTUM 


The discussion of Eqs. (45.18) and (45.19) shows that the angular momen- 
tum that is carried away by an emitted quantum has its makimum value 
h and is directed along the z axis, when Jo, = tJoz. From (45.21) we see 
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that this is the case in which (kly|n)* = i(k|z|n)*, or (kly|n) = —tUklx|n). 
Now zx = rsin 6 cos ¢ = 4rsin 0: (e* + e-*), and y =rsin 6sin¢ = 
— gir sin 6(e* — e~*), Then it is apparent from the ¢@ integration in 
(46.1) that, in order for the matrix element of y to equal —i times the 
matrix element of x, the magnetic quantum number of the initial state u; 
must be greater than the magnetic quantum number of the final state u, 
by unity. Application of Eq. (14.23) shows that the z component of 
angular momentum of the particle has decreased by f during the transi- 
tion. Thus angular momentum is conserved between the radiating par- 
ticle and the emitted quantum. 

The foregoing result is based on the connection between the classical 
and quantum current densities assumed in (45.20). The successful 
derivation of the Planck distribution law in Sec. 45 shows that (45.20) 
is correct so far as magnitude is concerned. The above demonstration 
of the conservation of angular momentum shows in addition that the 
phases of the initial and final states are inserted properly in (45.20) ; if, 
for example, J had been assumed proportional to u*Vun, an inconsistent 
result would have been obtained in the preceding paragraph. 

If the magnetic quantum number does not change in a transition, 
only the matrix element of z fails to vanish, and the discussion of (45.18) 
shows that the quantum carries off no angular momentum. This might 
seem at first to be in contradiction with the change by one unit of the 
orbital angular momentum quantum number l. The x and y components 
of the angular momentum of the particle do not commute with the z 
component (which in this case is known to be mh and does not change), 
so that they cannot be precisely specified. Their expectation or average 
values for states that are described by quantum numbers J and m are 
zero, since the diagonal elements of the matrices for the x and y com- 
ponents of the angular momentum are all zero [see Eqs. (27.26)]. Thus 
there is no observable change in any of the components of the particle’s 
angular momentum, and so the expectation value of the angular momen- 
tum carried off by the quantum should be zero. The x and y angular- 
momentum components for a particle in a stationary state can be thought 
of as fluctuating about zero in such a way that their average values are 
zero, although their average squares are not. The change in 1 corresponds 
to changes in these average squares. 


SELECTION RULES FOR MANY-PARTICLE SYSTEMS 


When a quantum-mechanical system consists of several particles that do 
not interact with each other, the total hamiltonian is simply a sum of 
terms like Hy) + H’ that appear in (44.13). The unperturbed energy 
eigenfunctions are products of single-particle eigenfunctions such as are 
discussed in Sec. 40 (they can be unsymmetrized if the particles are not 
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identical). It is clear that the matrix element that appears in the first- 
order perturbation theory of Sec. 44 (absorption and induced emission) 
involves a multiple integral of the form 


fo+ + SusG)ug(2) + + + [H') + AH’) 
+ SS ]wa(1) us (2) oF PT dr, dre ore 


Because of the orthogonality of different u’s for the same particle, this 
integral vanishes unless all the single-particle functions wa, ... are 
equal to the corresponding functions wa, ... , except for one. Thus 
only one of the particles can change its state in a transition, and the 
selection rules for a central force field are precisely those given above. 
Since the spontaneous transition probability can be related to the same 
integral through the Planck distribution formula, these selection rules 
hold for spontaneous emission as well as for absorption and induced 
emission. 

If the system consists of several charged particles whose mutual 
interactions cannot be neglected, we must base the selection rules on 
general conservation laws for total angular momentum and parity. It is 
not difficult to generalize the work of Secs. 44 and 45 to show that the 
dominant term, when the wavelength of the radiation is large in com- 
parison with the dimensions of the system, is the matrix element of the 
total electric dipole moment e:7; + eof. + --° . The angular momen- 
tum radiated by an oscillating dipole cannot, according to the discussion 
of (45.19), exceed & per quantum; this classical argument is based on an 
arbitrary current distribution, so that it is not invalidated if several 
particles contribute to the dipole. The interpretation of the conservation 
of angular momentum between the emitted quantum and the radiating 
system is complicated by the semiclassical nature of the entire treatment. 
A consistent treatment based on quantum electrodynamics shows that 
the correct selection rule based on this conservation law is that the 
total angular momentum quantum number of the system can remain 
unchanged or can increase or decrease by unity. An exceptional case is 
that in which this quantum number is zero for both initial and final states. 
The initial and final wave functions are then spherically symmetric, and 
an extension of the argument given at the end of Sec. 44 shows that a 
radiative transition between these states is not only forbidden (no 
allowed dipole transition) but strictly forbidden (no first-order transition 
whatever). 

The discussion of parity in Sec. 29 is easily extended to show that 
the parity of each energy eigenfunction can be well defined (even or odd) 
if the total hamiltonian is unchanged by reflection of the coordinates of 
all particles through the origin. The electric dipole moment operator 
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given above is evidently odd with respect to reflection of all coordinates; 
thus its matrix elements vanish in this case unless the initial and final 
states have opposite parities. This selection rule is known as the Laporte 
rule. 


PHOTOELECTRIC EFFECT 


When a bound system that contains charged particles is irradiated by 
sufficiently high-energy quanta, there is a finite probability that the 
system will be broken up. This process is usually called the photoelectric 
effect in the case of atoms and photodisintegration in the case of nuclei. 
As an example, we consider the ejection of an electron from an atom by 
a photon of energy hw > «, where —e is the ground-state energy of the 
electron. The initial wave function of the electron is uo(r), and in its 
final state it has the kinetic energy 

Be = hw —¢ (46.3) 
We suppose the radiation to be incident along the positive z axis and to 
be polarized with its electric vector along the x axis. Then the matrix 
element for the transition is given by the second of Eqs. (44.14) 

(k|H’°|0) = seh f upewzlAy (3) d'r (46.4) 
We assume that the final state can be represented to sufficient accuracy 
by the plane wave u,(r) = L-* exp (ik-r). This is equivalent to the 
assumption that the Born approximation is valid for the scattering of the 
electron by the remaining ion. 

With the help of a partial integration, Eq. (46.4) becomes 


(k|H’}0) = — CHA oles Uo exp E (% —k- r)| d’r 


mcLi 


From Kgs. (35.14) and (35.18), the transition probability per unit time 
from the bound to the ionized state is 


mkL* ; 
w= Te \(k|H’°|0)|? sin 6 dé d¢ 
It is convenient for what follows to introduce the momentum AK that is 
transferred to the atom: 


K="1,—k (46.5) 


where 1, is a unit vector in the z direction. The differential cross section 
for the photoelectric effect is equal to w divided by the incident flux of 
photons. This flux is obtained by dividing the incident intensity (44.12) 
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by fw, so that 
e*kk,? 
2rmcw 


o(6,¢) sin 6d0d¢ = 


i: Uo exp (iK - r) dr |’ sin 0 d0 do 


(46.6) 
ANGULAR DISTRIBUTION 


There are two factors in Eq. (46.6) that determine the angular distribu- 
tion of the ejected photoelectrons. One of these is the factor k,?, which 
shows that the electrons tend to have a cosine-squared distribution with 
respect to the polarization vector of the incident radiation. If the radia- 
tion is unpolarized, k,? must be replaced by $(k.? + k,?), so that there is 
a sine-squared distribution with respect to the direction of incidence. 
In either case, the electrons are ejected preferentially at right angles to 
the incident beam of photons. 

The appearance of the momentum transfer vector K in the integrand 
of (46.6) also affects the angular distribution. The discussion of Eq. 
(38.29) shows that integrals of the kind that appear in Eq. (46.6) generally 
decrease as K increases. Now K is smallest when k is in the z direction, 
so that the K dependence of (46.6) tends to shift the maximum in the 
differential cross section toward the forward direction. However, this 
effect is appreciable only when k and w/c are comparable in magnitude. 
If we assume for the moment that « can be neglected in Eq. (46.3), the 
quantity w/ck ~ hk/2mc = v/2c, where v is the speed of the ejected 
electron. Thus the forward shift of the cross-section maximum occurs 
for high-energy photons and ejected electrons, in which case ¢ can in 
fact be neglected as assumed above.! 


CROSS SECTION FOR THE ATOMIC PHOTOELECTRIC EFFECT 


We now specialize to the situation in which the photoelectron is ejected 
from the lowest state (K or 1s shell) of an atom. The initial wave 
function u(r) is then the wave function w190(7,9,¢) of Eq. (16.24) and is 
given by 

ao h? 


uo(r) = (wat)-tere a = 


Z (46.7) 


So long as wo is spherically symmetric, the angle integration in (46.6) is 
easily performed and gives 


Sre*kk,? 
mewK? 


o(0,¢) = 


f i uo(r) sin Kr r dr ; (46.8) 


1 The quantities & and w/c can also be comparable very close to the photoelectric 
threshold, when ww is only slightly greater than e. However, the Born approximation 
is not valid in this case. 
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Substitution of (46.7) info (46.8) yields 


2773 2 
0(0,8) = Ri (46.9) 
The discussion of Eq. (38.4) shows that the Born approximation is 
most nearly valid for high energy and for Ze?/hvu <K 1. Nowe = Z%e?/2ao, 
so that h?k?/2me = (hv/Ze?)?. Thus e can be neglected in Eq. (46.3). 
Since, aS was shown above, w/ck ~ v/2c in this case, the magnitude of 
K given by Eq. (46.5) is approximately k(1 — v cos @/2c). Further, 
ka = hv/Ze? > 1. Thus the factor 1 + K?a? in the denominator of Eq. 
(46.9) can be replaced approximately by k?a?(1 — v cos 6/c). We then 


obtain for the high-energy photoelectric differential cross section 


32e? 


o(9,¢) = mew(ka)® ® 


in? 6 cos? @ (1 + * cos s) (46.10) 
Since the electron has been treated nonrelativistically, v/c must be fairly 
small in comparison with unity, and therefore terms of order v?/c? have 
been neglected in Eq. (46.10). Integration over the angles yields the 
total cross section 


— 1287 e? 
°* 3 mca(ka)® 


It follows from (46.3) and (46.7) that o is proportional to Z*/(hw)!. 
In using Eq. (46.11), it must be remembered that o is the total cross 
section for each of the K electrons and hence must be doubled to obtain 
the total atomic cross section for the K-shell photoelectric effect. 

It is interesting to note that the leading term of Eq. (46.10), in 
which v/c is neglected in comparison with unity, and all of Eq. (46.11) 
result from the electric dipole approximation discussed in Sec. 44. In 
this approximation, e*/¢ is replaced by unity in Eq. (46.4). 


(46.11) 


IMPROVEMENT ON THE BORN APPROXIMATION 


There are two respects in which the foregoing calculation is based on 
first-order perturbation theory. First, the matrix element (46.4) is 
regarded as small, so that the interaction between the electron and the 
electromagnetic field is treated to first order. Second, the electron wave 
function is taken to be a plane wave in the final state, so that the effect 
of the ionic potential on this state is neglected. It is very difficult to 
improve on the calculation from the first point of view, and it is hardly 
worthwhile because of the smallness of the electron-radiation interaction. 
On the other hand, an improvement with regard to the second point is 
feasible, and some trouble in this regard is justified since the results 
would then be applicable to low energies and to large values of Z; that 
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is, Ze?/hv would not then have to be small in comparison with unity. 
This can be accomplished by making use of the distorted wave Born 
approximation that was discussed in Sec. 38. 


PROBLEMS 


1. Show that, if V-J = p = 0, the most general solution of Maxwell’s equations 
can be expressed in terms of potentials such that V- A = ¢ = 0. 


2. Show that the probability density associated with Eq. (44.1) is given by (7.1) 
and that the probability current density must be changed from (7.3). 


3. Show that, if the gauge transformation (44.5) is accompanied by the transforma- 
tion y’ = yet*x/*, the form of the wave equation (44.1) is unaffected. 

4. Estimate the order of magnitude of eA/cp, when e is the electronic charge, A is 
the magnitude of the vector potential for the visible part of the spectrum that cor- 
responds to the radiation in a cavity at several thousand degrees centigrade, and p is 
the momentum of an electron in the first excited state of hydrogen. 

5. Verify Eq. (44.20) by means of the wave equation, without recourse to matrix 
methods. 

6. Show that the transition probability for spontaneous emission is equal to the 
transition probability for induced emission that would result from an isotropic field of 
such intensity that there is one quantum per state of the field in the neighborhood of 
the transition frequency. 


7. A hydrogen atom in its first excited (2P) state is placed in a cavity. At what 
temperature of the cavity are the transition probabilities for spontaneous and induced 
emission equal? 

8. What is the spontaneous transition probability per unit time, expressed in sec™!, 
of a hydrogen atom in its first excited state? 


9. What is the selection rule for allowed transitions of a linear harmonic oscillator? 
What is the spontaneous transition probability per unit time, expressed in sec—!, of an 
oscillator in its first excited state, when e, m, and w are the same as in Prob. 8? 


10. Show that a logarithmic factor like that obtained in Prob. 14, Chap. 9, always 
appears in the cross section for excitation of a one-electron atom by electron impact, if 
the corresponding radiative transition is allowed. Derive the simplest relation that 
you can between the differential cross section for excitation by electron impact ard 
the corresponding transition probability for spontaneous emission, assuming the 
transition is allowed. , 


11. Make use of the dipole expression (45.12) for the radiated intensity to find the 
angular distribution of the radiation when Jo, = 1Joz, Jo: = 0. Show also that the 
total power radiated is still given by Eq. (45.13). 

12. Show that k-r in Eq. (44.19) is of order e?/Ac ~ zy for a hydrogen atom 
transition. 

13. Assume that the interaction between the neutron and the proton that make 
up a deuteron can be represented by a square well potential for which a = 0 (zero- 
range or 6-function interaction), and that the only bound energy level of the system 
has l = 0 and e = 2.23 Mev (million electron volts). Show that no error is involved 
in the calculation of the photodisintegration cross section if the final-state wave func- 
tion is taken to be a plane wave. Find the differential and total cross sections for 
unpolarized photons. 


12 
Atoms, Molecules, and 
Atomic Nuclei 


This chapter is not intended to be a complete survey of the properties 
of atomic systems. It is primarily a presentation of a relatively few 
problems that arise in connection with the structure of matter, selected 
because they supply interesting and instructive applications of quantum 
mechanics. These problems are grouped according to subject; enough 
explanatory material is included so that the treatment is coherent, 
although severely limited in scope. 


47_JAPPROXIMATIONS IN ATOMIC STRUCTURE! 


The ground states of the two lightest atoms, hydrogen and helium, were 
considered in Secs. 16 and 32, respectively. Variation calculations simi- 
lar to those described for helium have been carried through for others of 


1 For a more detailed discussion of the material of this section and the next, see 
E. U. Condon and G. H. Shortley, ‘The Theory of Atomic Spectra’”’ (Cambridge, 
London, and Macmillan, New York, 1935); J. C. Slater, “Quantum Theory of Atomic 
Structure,” vol. II (McGraw-Hill, New York, 1960). 
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the light atoms. This section describes some of the approximations that 
have been used for the heavier atoms. The alkali atoms are discussed 
separately in Sec. 48. 


CENTRAL-FIELD APPROXIMATION 


The starting point of calculations on all except the lightest atoms is the 
central-field approximation. This assumes that each of the atomic elec- 
trons moves in a spherically symmetric potential energy V(r) that is 
produced by the nucleus and all the other electrons. The approximation 
is a good one if the deviation from the V(r) for one electron produced by 
close passage of other electrons is relatively small. This is actually the 
case, since the constant nuclear potential is of the order of Z times as 
large as the fluctuating potential due to each nearby electron, and the 
latter varies quite slowly (inversely) with the separation distance. The 
two principal problems are then the calculation of the central field and 
the correction of the approximate results obtained from it. Before con- 
sidering these problems, we discuss some general properties of the central 
field. 

The potential energy V(r) for a neutral atom has the coulomb form 
—e?/r at a great distance r from the nucleus, since the removal of the 
electron whose potential is being measured leaves a singly charged positive 
ion. ‘The electron in the hydrogen atom, for which the potential energy 
is —e?/r at all r, was shown in Sec. 16 to have an infinite number of 
bound energy levels characterized by the quantum numbers n, lJ, and m. 
An infinite number of energy levels is also expected for V(r), since for 
large n the electron wave function is small near the nucleus, and only the 
form of V(r) for large r is significant. An important difference between 
the two situations is that the degeneracy between states of the same n 
and different J that occurs in hydrogen is.removed in a noncoulomb 
central field. This is because the electrons that have smaller angular 
momentum penetrate closer to the nucleus, and V(r) is stronger (more 
negative) than —e?/r there, since the nucleus is less completely screened 
by the other electrons. Thus, for given n, the states of lowest / have the 
lowest energy. The degeneracy with respect to m is not affected, since 
this occurs whenever the potential is spherically symmetric. 

Because of the spin, four quantum numbers n, /, m, and m, are 
required to specify the state of an electron in a central field. The orbital 
quantum numbers / and m, are the same as Zand m in the hydrogen atom, 
m, = +4 specifies the spin orientation, and n is a natural generalization 
of the total quantum number that appears in hydrogen. Equation 
(16.14) shows that n — J — 1 is the number of nodes of the radial part 
of the hydrogen wave function; this definition of n is carried over to the 
general central field, so that / does not exceed n — 1. 
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PERIODIC SYSTEM OF THE ELEMENTS 


According to the Pauli exclusion principle (see the discussion of anti- 
symmetric wave functions in Sec. 40), not more than one electron in an 
atom can have a particular set of values of the four quantum numbers 
given above. As Z increases, electrons fill the one-electron states of 
lowest energy in succession; the ground state of an atom in the central- 
field approximation is that in which there are no unfilled electron states 
that have lower energy than any that are occupied. Because of the 
degeneracy with respect to m, and m,, there can be as many as 2(2/ + 1) 
electrons with the same energy in a shell that is specified by n and I. 
It is apparent, then, that the ground-state configuration of the electrons 
in an atom can be described by specifying the number of electrons in 
each shell. In the central-field approximation, all shells that contain 
any electrons are full except perhaps that which has the highest energy. 

The chemical properties of atoms are determined for the most part 
by the least tightly bound, or valence, electrons, which are in the shell of 
highest energy. The most important factors are the number of occupied 
and unoccupied electron states in this shell, and the energy interval 
between this and the next higher (empty) shell. For example, an atom 
tends to be chemically inert if its highest shell is full and there is an 
appreciable energy gap to the next higher shell, since then electrons are 
not readily shared with other atoms to form a molecule. The quasi- 
periodic recurrence of similar highest shell structures as Z increases is 
responsible for the periodic system of the chemical elements. 

In the usual spectroscopic notation, the n value of a shell is given as 
a number, the / value as a letter, and the number of electrons in the shell 
as a numerical superscript. The letter code for 1 and the maximum 
number 2(2/ + 1) of electrons in a shell are as follows: 


f=0, 1, 2 93,. 4 8263 
oy pe. ot Te Gee ye Bes 
2(21 +1) = 2, 6, 10, 14, 18, 22,... 
For example, the ground-state configurations of sodium (Z = 11) and of 
mercury (Z = 80) are 
Na: 1s?2s?2p*3s 
Hg: 18?2s?2p°3s?3p%4s?3d !°4p%5s74d°5p%6s24f145d'0 
The ground-state configurations of many of the elements can be 
written down simply from a knowledge of the order in which the energies 


of the shells increase. This order can be inferred from spectroscopic 
evidence and is as follows: 


1s,2s,2p,38,3p,[48,3d],4p,[5,4d],5p,[6s,4f,5d],6p,[7s,5f,6d] 


\ 
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The brackets enclose shells that have so nearly the same energy that they 
are not always filled in sequence. These shell energies are close together 
because the increase in n and the decrease in | tend to compensate each 
other; thus the 4s state, which has a higher energy than the 3d state in 
hydrogen, is depressed by the penetration caused by its low angular 
momentum. The s shell in each bracket is always filled first, although it 
can lose one or both of its electrons as the other shells in the bracket fill 
up. Apart from the brackets, there are no deviations from the indicated 
order of filling. 

Table 2 gives the ground-state configurations of each of the elements. 
An atom contains all the full shells that occur above and to the left of its 
position in the table. Since the number of s electrons varies as each d 
shell fills, the d columns are subdivided to show the number of s electrons. 
The two groups of atoms that have a partially full f shell in their ground- 
state configurations fit in at * (rare earths) and at t (heaviest elements). 
The first group has the 6s shell full and the second has the 7s shell full; 
the distribution of electrons in d and f shells for each group is shown 
below the main table. 

A few of the periodicities are worth explicit mention. The first 
electron to go into each s shell beyond 1s gives an alkali, and the elements 
just before each of these (full 1s shell or a full p shell) are rare gases. The 
elements with the same number of electrons in a p shell have similar 
chemical properties; this is especially striking in the case of the halogens 
(one electron short of a full p shell). The elements with full 2s and 3s 
shells (Be and Mg) that are followed by p shells have somewhat different 
properties from the alkaline earths, which have full s shells followed by d 
orfshells. The filling of the 4s,3d shells gives elements somewhat similar 
to those arising from the filling of the 5s,4d shells. The elements that 
correspond to full bracketed shells (Zn, Cd, and Hg) are quite similar, as 
are the noble metals (Cu, Ag, and Au) in which an s electron is missing 
from the full bracketed shells. 


THOMAS-FERMI STATISTICAL MODEL 


We now turn to the first of the problems associated with the central-field 
approximation. Two methods have been used for the determination of 
the potential energy V(r). The first of these, due to Thomas! and Fermi,’ 
is discussed here, and the second, due to Hartree, is taken up later. The 
Thomas-Fermi statistical model assumes that V(r) varies slowly enough in 
an electron wavelength so that many electrons can be localized within a 
volume over which the potential changes by a small fraction of itself. 
The electrons can then be treated by statistical mechanics and obey the 


1L. H. Thomas, Proc. Cambridge Phil. Soc. 28, 542 (1927). 
2. Fermi, Z. Physik 48, 73 (1928). 
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Table 2 Ground-state electron configurations of the elements! 


1 33 


32 


Ga 
31 


: Te 


Sb 


¥ 
* 


T 5f: 


1This table is taken from Condon and Shortley, op. cit., p. 333, with recent assignments from 


stants,” pp. 198-199 (Longmans, 


not included in the table, are americium (Am, 95), curium 


, 97), californium (Cf, 98), einsteintum (Es, 99), fermium (Fm, 100), 


mendelevium (Md, 101), nobelium (No, 102), and lawrencium (Lr, 103). 


The heaviest elements, 


G. W. C. Kaye and T. H. Laby, ‘“‘Tables of Physical and Chemical Con 
(Cm, 96), berkelium (Bk 


London, 1966). 
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Fermi-Dirac statistics mentioned in Sec. 40. At normal temperatures, 
the thermal energy «7 is very small in comparison with V(r) everywhere 
except at the edge of the atom, where the chance of finding an electron is 
small. In this case, the Fermi-Dirac statistics requires that the electron 
states fill in order of increasing energy, as assumed above. The difference 
between the present treatment and the more general discussion given 
earlier in this section lies in the additional assumption that V(r) is 
_ sensibly constant over a region in which many electrons can be localized. 

The number of electron states in a cube of edge length L at the walls 
of which the wave functions obey periodic boundary conditions was 
computed in Sec. 11 to be (L/2m)* dk, dk, dk,. This must be multiplied by 
2 to take account of the two possible spin states; then the number of 
states for which the magnitude of the momentum p = hk is less than or 
equal to po is 


L\3 posh fe [2x . _ pol? 
2(+) I, i 0 k? dk sin 0 dé d¢@ ~ 352%8 


If all these states are occupied, the number of electrons per unit volume 
whose kinetic energy does not exceed po?/2m is po?/3r*h?. Now the maxi- 
mum kinetic energy at any distance r from the nucleus is — V(r), since 
otherwise electrons would escape from the atom. We thus obtain a rela- 
tion between the volume density of electrons, n(r), and the potential 
energy: 

[—2mV (r)]3 


od a 3r7h?3 


(47.1) 

The electrostatic potential — V(r)/e is also determined by Poisson’s 
equation in terms of the charge density —en(r): 

~ vv = — ae G 4 = 4ren(r) (47.2) 
Equations (47.1) and (47.2) are two simultaneous equations for n and V. 
The boundary conditions on the solutions can be expressed in terms of V 
alone for a neutral atom of atomic number Z. Asr— 0, the leading term 
in the potential energy must be due to the nucleus, so that V(r) — —Ze?/r. 
As r— o, there must be no net charge inside the sphere of radius 7, so 
that V falls off more rapidly than 1/r, and rV(r) > 0. The boundary 
condition at infinity is different from that assumed earlier in this section, 
where V was taken to have the asymptotic form —e?/r. The V discussed 
earlier is the potential experienced by one of the atomic electrons, whereas 
the Thomas-Fermi potential is that experienced by an infinitesimal test 
charge. The difference between the two potentials emphasizes the 
statistical nature of the approximation made by Thomas and Fermi. 
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The solution for V is exact in the limit in which m becomes infinite and 
e becomes zero in such a way that m’e4 remains constant; then the elec- 
tron wavelength becomes zero, and the density of particles becomes 
infinite. In this limit the potential is constant over many wavelengths, 
and enough particles are present so that statistical mechanics can be 
applied. 


EVALUATION OF THE POTENTIAL 
Elimination of n(r) from Eqs. (47.1) and (47.2) leads to an equation for 
—V(r) 
ld{,d(—V)| _ 4e—2mV(r)}! 
r? dr dr ” Srh3 


(47.3) 


Equation (47.3) and the boundary conditions given above are conven- 
iently expressed in a dimensionless form in which Z, E, m, and h appear 
only in scale factors. We put 


Ze? 


V(r) = - ee r = bx 
(47.4) 
—_ 1/3r\! fh? _ 0.885a0 
~ 2\ 4) me?Zi ZF 
where @) = h?/me*?. With these substitutions, (47.3) becomes 
d*x 
gi —= = yi 
dx? ~ * (47.5) 
x=latzx=0 x=Oatr=0 


The most accurate solution of Eq. (47.5) was computed by Bush and 
Caldwell! with the help of the original differential analyzer, and is 
expressed in the form of a numerical table. 

Equations (47.4) show that the “radius” of an atom is inversely 
proportional to the cube root of the atomic number, if this radius is 
interpreted to be that of a sphere that encloses a fixed fraction of all the 
electrons (see Prob. 1). These equations can also be used to show that 
the Thomas-Fermi approximation improves with increasing Z. The 
potential at the atomic radius is proportional to Z', so that a typical 
electron wavelength is proportional to Z~!. The distance over which 
the potential changes by a definite fraction of itself is proportional to 
the atomic radius, or Z~!. Thus the fractional change of the potential 
in an electron wavelength is proportional to Z~? and decreases with 
increasing Z. Moreover, since the number of electrons is equal to Z, 
the use of the statistical method is better justified as Z increases. 


1V. Bush and S. H. Caldwell, Phys. Rev. 38, 1898 (1931). 
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HARTREE’S SELF-CONSISTENT FIELDS 


The second method for obtaining a central field is due to Hartree.! This 
model assumes that each electron moves in a central field that can be 
calculated from the nuclear potential and the wave functions of all the 
other electrons, by assuming that the charge density associated with an 
electron is —e times its position probability density. The Schrédinger 
equation is solved for each electron in its own central field, and the 
resulting wave functions made consistent with the fields from which 
they are calculated. Thus the kth electron is described by a normalized 
wave function %,(r;) that is a solution of the equation 


[- ~ vo = — “+ 2, J |us(rs)|? = ane = a,| Un(te) = exur(te) (47.6) 


where rj, = |r; — r,|. If there are Z electrons in the atom, (47.6) con- 
stitutes a set of Z simultaneous nonlinear integrodifferential equations 
for the Z functions u,(r,). It is therefore not feasible to solve these equa- 
tions directly, and Hartree used a method of successive approximations. 

A potential energy that approximately represents the second and 
third terms in (47.6) is assumed, electron wave functions computed, and 
new potentials for each electron found from these wave functions. This 
process is continued until the potentials are self-consistent to a high order 
of accuracy. The principal approximation made is the averaging of the 
potential energy given as the third term in (47.6) over the angles of 
r, to make it spherically symmetric. The solutions of (47.6) can then be 
expressed as products of radial functions and spherical harmonics. A 
further simplification is made so that the 2(2/ + 1) or fewer electrons 
in a shell all move in the same potential and have the same radial wave 
function. 

It is apparent that the Hartree approximation neglects correlations 
between the positions of the electrons, since the entire wave function 
for all the electrons is assumed to be a simple product of one-electron 
functions 


(OL T2, . + + Ez) = Us(ti)ue(te) - - * uz(tz) (47.7) 


It is also clear from (47.7) that antisymmetrized wave functions are not 
employed. The antisymmetry is considered only insofar as the quan- 
tum numbers of the one-electron states u, are chosen in agreement with 
the exclusion principle.’ 


1D. R. Hartree, Proc. Cambridge Phil. Soc. 24, 111 (1928). 


* Antisymmetrization is taken into account in the Hartree-Fock method ; see, for 
example, Slater, op. cit., chap. 17. 
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CONNECTION WITH THE VARIATION METHOD — 


We now show that the Hartree approximation results from an optimum 
variation calculation with the trial function (47.7).1. The wave equation 
with inclusion of interelectronic interactions but neglect of spin-orbit 
terms (see below) is 


Hy = ky 
_v(_ i, _ Ze e (47.8) 
H= é ( om V2, Tk ) ss ny) ik 


where 7 > k implies a double summation over all different pairs of indices 
j and k. We wish to minimize the expectation value of H. 
From (47.7) and (47.8) we obtain 


/ oe [vn dr, oe arz 
a 2 J ug (Te) (- La Ve *) Ux(Te) ar, 
+ SY ff upenutcen) = usleruales) dtr dire (47.9) 
j>k : 


since the % are normalized. The optimum y is obtained by varying 
each of the wu separately to minimize (47.9). The only dependence of 
(47.9) on a particular one-electron function u, is through the terms 


2 2 
i uz (Te) (- ue Ves *) un(t,) dry 
; 2 
- 2, / / uj (tj)ux (Te) = Uj(Ij)UK(Ie) dr; dry 


= f uk(t:)Hu(t,) dr, (47.10) 


h? Z 


2 2 
H= ae Ds i: luj(x,)2| = der, 
k ! Vik 
jJ#k 


~ 2m r 


The integral in (47.10) is the expectation value of the operator H; 
for the function u,. From the discussion of Sec. 32, it follows that this 
is a minimum when x is an eigenfunction of H; that corresponds to its 
lowest eigenvalue «. 


Ayu = €,UK (47.11) 


Since Eqs. (47.11) and (47.6) are identical, we see that the Hartree wave 
1J. C. Slater, Phys. Rev. 35, 210 (1930); V. Fock, Z. Physik 61, 126 (1930). 
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functions are the best from the point of view of the variation method 
that can be written in the form (47.7). 

The energy associated with this wave faneuons is just the integral 
(47.9), which can be written with the help of (47.6) 


| << [ viv arr, st dng =)« 
= >) / ‘| ua) eure)? ar dr, (47.12) 
j>k 


The electrostatic interaction terms between electrons are counted twice 
in the summation over « and so have to be subtracted out to give (47.12). 
Thus the energy of the atom is not just the sum of the «,, although each 
«. is roughly the energy of removal of the kth electron. This last is not 
strictly true, since the removal of an electron alters the self-consistent 
fields and hence the wave functions and e’s for the remaining electrons. 
However, « is found to be an especially good approximation to the energy 
of removal in the case of an inner electron (x-ray level). 


CORRECTIONS TO THE CENTRAL-FIELD APPROXIMATION 


We now turn to the second problem mentioned at the beginning of this 
section, the correction of the approximate results obtained from the 
central field. Two terms are omitted in the central-field approximation: 
the difference between the actual electrostatic interaction between elec- 
trons and the average interaction that is included in the central field, and 
the spin-orbit energy. The latter is an interaction energy between the 
spin and the orbital motion of each electron and has the form 


y E(re)Lx - Sk (47.13) 
i 


Here, L;, is the orbital angular momentum operator r, X p, of the kth 
electron and has the properties of the J operator introduced in Sec. 27; 
the eigenvalues of L,? and Z,, are given in terms of the quantum numbers 
land m, for the kth electron as l(l + 1)h? and mh, respectively. SS, is the 
spin angular momentum 3f¢, of the kth electron that was introduced in 
Sec. 41. The function &(r) is given by’ 


1dV 


E(r) = —— a (47.14) 


in terms of the central-field potential energy V(r). 


1L. H. Thomas, Nature 117, 514 (1926). This energy is a consequence of rela- 
tivity and is derived as such in Chap. 13. It was first obtained from the precession 
of the spin axis of the electron, part of which is of electromagnetic origin (Larmor 
precession) and part of which comes from relativistic kinematics (Thomas precession). 
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In considering the effects of these terms, we shall assume that the 
perturbed eigenfunctions, which are linear combinations of various con- 
figuration wave functions, have only negligibly small amounts of all but 
one configuration mixed in them. From Eq. (31.10), it is apparent that 
this is the case if the interconfiguration matrix elements of the perturba- 
tion are small in comparison with the energy intervals between unper- 
turbed configuration energies. 

It can be shown that the part of the summation in (47.13) that 
includes electrons in full shells is zero, since the function & is the same 
for all electrons in a shell and the contributions from electrons with 
opposite m, and m, cancel. Thus the electrons in full shells can be 
ignored and the summation extended only over the remaining electrons. 
The case in which there is just one electron outside full shells is of interest 
in connection with the ground state and low excited states of the alkali 
atoms; it will be discussed in some detail in the next section. For the 
present, we consider very briefly the more general situation, always 
assuming that each atomic state is based on just one configuration of 
the electrons. 


LS COUPLING SCHEME 


There are, in general, a number of states that belong to the same con- 
figuration and that are degenerate in the central-field approximation. 
These states differ in the assignment of m, and m, quantum numbers 
to the individual electrons. The theory of complex spectra consists in 
determining the linear combinations of such suitably antisymmetrized 
wave functions that diagonalize the perturbation to first order (see Sec. 
31), along with the corresponding perturbed energy levels. 

The most usual situation is that in which the hitherto-neglected 
electrostatic terms are larger than the spin-orbit energy; this is called 
the Russell-Saunders case.1 States of the same configuration can be 
classified as eigenfunctions of any dynamical variables that commute 
with the hamiltonian and hence are constants of the motion (see Sec. 24). 
When all perturbations are included, the only true constants of the motion 
are the total parity and the total angular momentum J of the electrons 


J=L+S=) (i + S&) (47.15) 
k 


J is a constant because the angles that specify the orientation of the 
atom as a whole, and that are the canonically conjugate variables to the 
components of J, do not appear in the hamiltonian of an isolated atom. 
When the electrostatic perturbation is included but the spin-orbit energy 
neglected, the same argument can be applied to show that the total orbital 


1H. N. Russell and F. A. Saunders, Astrophys. J. 61, 38 (1925). 
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angular momentum L and the total spin angular momentum S are 
separately constants of the motion. The individual S; need not be 
constants, even though no spin-dependent forces act in this approxima- 
tion, since the use of antisymmetric wave functions couples the spins to 
the electrostatic energy (see the discussion of the excited states of helium 
in Sec. 41). 

A state can be specified by the quantum numbers, J, L, S, M, M1, 
and Ms, which are connected with eigenvalues of angular momentum 
operators through 


PaII +10 J, = Mh 
L? = L(L + 1h? L, = Mth (47.16) 
S? = S(S + 1h? S, = Mgsh 


When the spin-orbit energy is neglected, the electrostatic energy separates 
states of different L; in some cases, only particular S values are permitted 
because of the exclusion principle. Only two of the other four quantum 
numbers are independent, and so we can use either LSM,M 5 or LSJM to 
specify a state. Because of the spherical symmetry of the hamiltonian 
with respect to its space and spin parts separately, the energy is inde- 
pendent of the directional quantum numbers M, and Ms, and there are 
(2L + 1)(2S + 1) degenerate states. For given LZ and S, the states 
specified by J and M are linear combinations of those specified by Mz, 
and Ms, so that the same amount of degeneracy appears in the LSJM 
representation; these linear combinations can be expressed in terms of the 
Clebsch-Gordan coefficients discussed in Sec. 28. This is called the LS 
coupling scheme, since the individual L; are coupled together to form the 
total L, and the individual S; to form the total S. 

If now the spin-orbit energy is included, L and S are no longer con- 
stants of the motion, although J and M still are. However, we assume 
that states of different L and S are sufficiently well separated by the 
electrostatic energy so that their mixing due to spin-orbit energy can be 
neglected. This is analogous to the earlier assumption that different 
configurations are sufficiently well separated by the central field so that 
their mixing due to the electrostatic energy can be neglected. States of 
different J in the LSJM representation are now split apart by the spin- 
orbit energy; the energy is still independent of M, so that there are 2J + 1 
degenerate states. A Russell-Saunders state is usually written in the 
form ‘D,, where the superscript is the multiplicity 2S + 1, the letter 
(now capitalized) is the L value according to the code given earlier in this - 
section, and the subscript is the J value; in this case S = 3, L = 2, and 
J=%. Since J = L +S, the triangle rule discussed at the beginning of 
Sec. 28 shows that J can only be one of the numbers L + S,L + S — 1, 
...,(D—S|. 
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SELECTION RULES 


The selection rules in the Russell-Saunders case can be obtained from the 
discussion of Sec. 46. Only one electron is involved in a transition, so 
that in an allowed transition the configuration changes through a change 
in one of the l’s by one unit; this also changes the parity. Since the 
electric dipole moment does not involve the spins, and the spin functions 
for different S are orthogonal (see Prob. 4), S does not change in an 
allowed transition. The conservation of angular momentum between 
atom and radiation field further requires that J and L each change by 1 
or 0. Tr..nsitions between states both of which have J = 0 are strictly 
forbidden. 

Intersystem lines that join states of different multiplicity (change in 
S) sometimes occur and indicate a partial breakdown of LS coupling. 
The very intense mercury resonance line at 2537 A is an intersystem line: 
3P,;— 18>. This transition is allowed so far as the changes in J, L, 
configuration, and parity are concerned but not as regards the change in 
S. The *P; state is partially mixed by the spin-orbit energy with a 
higher singlet (S = 0) state of the same J and parity, and this makes an 
electric dipole transition possible. 


jj COUPLING SCHEME 


The opposite approximation to that involved in LS coupling assumes that 
the spin-orbit energy is large in comparison with the electrostatic energy. 
If the latter is neglected, each electron can be characterized by the quan- 
tum numbers nljm rather than nlmm,, where (Li + Si)? = j(7 + 1)h? and 
Liz + Ste = mh. The electrostatic energy then splits apart states of 
different J. 

This is called the 77 coupling scheme, since the orbital and spin angu- 
lar momenta of the individual electrons are coupled together to form j’s, 
from which the states are built up. It is mainly of interest in heavy 
atoms, where the large V(r) makes the spin-orbit energy (47.13) the 
dominant perturbation. 


48U0THE ALKALI ATOMS 


The ground-state configuration of an alkali atom consists of a series of 
full shells followed by a single s electron and so is 78,._ The inner rare-gas 
configuration is so stable that all but quite high excited states of the atom 
involve only the valence electron. Thus the alkalis can be treated to 
quite good approximation in terms of a model in which a single electron 
moves in a spherically symmetric noncoulomb potential energy V(r). 
In this section we calculate the energy levels and the intensities of allowed 
transitions in the absence and presence of an external magnetic field. 
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DOUBLET SEPARATION 


The configuration of an alkali atom can be specified by a single pair of 
quantum numbers nl. Since there is only one electron, the perturbing 
electrostatic term mentioned in the preceding section does not appear. 
In the absence of external fields the hamiltonian, including the spin-orbit 
energy (47.13), is 

i x V+ V(r) + ELS (48.1) 
where £(r) is given by (47.14). As in Sec. 47, we neglect the mixing of 
different configurations produced by the spin-orbit energy and regard 
this term as a perturbation that removes the mm, degeneracy within each 
configuration. The total angular momentum J = L + S of the valence 
electron is a constant of the motion (see Prob. 5), so that states can be 
designated by jm instead of mym., where J? = j(j + 1)h? and J, = mh. 
The states of different j have different energies, but there is still a (27 + 1)- 
fold degeneracy due to m. The removal of the m degeneracy by a mag- 
netic field is discussed later in this section. 

The difference in energy between states of different 7 is due to the 
L-S term in (48.1) and can be found from its expectation value or diago- 
nal matrix element [see Eq. (31.8)]. We have the operator relation 


P=(L+S)?=L?+8?+2L-S (48.2) 


Since J, j, and s are all good quantum numbers (s = 4 for one electron), 
Eq. (48.2) can be solved for the diagonal matrix element of L- 8S: 


(j|L > Sllj) = 300 + 2) — 10 + 1) — a? (48.3) 


Now if l is different from 0, 7 can be either! + Zor! — 3. Thus the first- 
order perturbation arising from é(r)L- S is 


Ufa heen 
—3 + Ign ifj7=1-—2 (48.4) 
ta = i [Raa(r)|2(r) r2dr 1>0 


where R,:(r) is the normalized radial part of the unperturbed eigenfunc- 
tion associated with the nl configuration. Since V(r) represents an 
attractive potential energy, &(r) given by (47.14) is positive and f,7 is 
positive. Thus (48.4) shows that the state with higher 7 has the higher 
energy. The pair of states is called a doublet; the doublet structure 
characterizes all the moderately excited levels of the alkali atoms except 
those for which J = 0, in which case j can only be 3. 

The doublet separations can be calculated from (48.4) if the radial 
function is known. We can get a rough estimate of their dependence on 
n by using the hydrogenic wave functions given in Eq. (16.24) and assum- 
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ing that V(r) has the coulomb form —Ze?/r. Substitution into (47.14) 
and (48.4) gives, with the help of the generating function (16.21) for the 
associated Laguerre polynomials, 


2702 peo 
ta = EE [ol Rao a 
e*n?Z4 

~ 2mc?ap*nF(l + 2)(1 + 1) 
This is valid only for 1 > 0; the singularity in &(r) at r ='0 makes the 
integral for {no diverge there, so that the perturbation approximation is 
not valid. It follows from (48.4) and (48.5) that the doublet separation is 
proportional to n~’, and this is in fair agreement with observation. The 
absolute value of the doublet separation and its dependence on / are not 
given at all by this simple theory, since the effective Z is difficult to 
estimate and depends markedly on / because of penetration.! 


(48.5) 


DOUBLET INTENSITY 


We now calculate the relative intensities of the two lines of the allowed 
doublet ?P,— 7S, and ?P,— *8,, under the assumption that the radial 
wave functions are the same for the two excited ?P states. Transitions of 
this type give rise to the principal series in the alkali spectra. From Eq. 
(45.22), the spontaneous transition probabilities, and hence the observed 
intensities if the two P states are equally likely to be occupied, are propor- 
tional to the squares of the dipole matrix elements.’ 

The dependence of the two excited ?P states and the ground 7S state 
on the angular and spin coordinates of the electron is obtained by finding 
linear combinations of products of the four spherical harmonics Y1,1(6,¢), 
Y1,0(0,¢), Y1,-1(0,6), and Yo0(6,¢), and the two spin wave functions (+) 
and (—), that are eigenfunctions of J? and J, (see Secs. 14 and 41 for dis- 
cussion of the angle and spin functions). These combinations can be 
obtained from the Clebsch-Gordan coefficients given in Eq. (28.13): 


m= $ (+) Via 

2P 2 3-4 24(+) Yio i (—) Yi] 

: =e 3-4[23(—) Vio + (+) Vial 
—$ (—)¥i-1 

, (48.6) 
ap | m= 4 3-H (+) Yio + 2) Ful 

: —% 3-4(—) Yio — 24(+)¥i i 
; m= (+) Yo 
ss a (—)Yo.0 


1 The effect of the spin-orbit interaction on the energy levels of hydrogen is found 
in the next chapter as part of an exact relativistic calculation. 

2 The energy difference between the two upper states is so small that the w? factor 
in (45.22) does not affect the intensity ratio appreciably. 
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The wave functions (48.6) can be used to calculate the matrix ele- 
ments of x = rsin 6 cos ¢, y = rsin 6sin ¢, andz = reos 6. We assume 
that the radial functions associated with (48.6) are all the same, so that 
the radial part of the matrix-element integral is a common factor through- 
out. The angle parts of the integrals are easily evaluated by making use 
of the explicit expressions for the Y’s in terms of 6 and ¢ given in (14.16).! 
The products of spin functions follow the simple rules (+)'(+) = 1, 
(—)t(+) = 0, etc. In this way we obtain the following values for the 
squares of the magnitudes of the indicated matrix elements, expressed in 
units of 45 of the common radial factor: 


m= $tom= 7% |z/? = ly? = 3 lel? = 0 
to | -$[elt = [yl? = kel? = 0 
2 2 2 2 
P'S) gt et = yO 
or ee ne 
, 1 {m=ztom= F [z/? = ly|? = 0 lz|? = 2 
FP ebaghe2 kro 
(48.7) 
Similar results are obtained for the transitions that start from m = —} 


and — $; altogether, they confirm the m selection rules of Sec. 46. 

It follows from (48.7) that the sum of the intensities of all the lines 
that originate on each of the four ?P, states is equal to 6, in the above 
units. It is to be expected that these sums are equal since the four values 
of m differ only in the orientation of the angular momentum, and this 
should not affect the intensity. However, the total intensity from each 
of the two *P, states is also equal to 6. The equality of total intensities 
from each state formed from a given L and S is a general property of LS 
coupling; this makes the observed intensity, which is that from all the 
states that are degenerate with respect to m, proportional to 2J + 1.2 
In the example considered here, the two lines of the doublet have inten- 
sities in the ratio 2:1. This is observed for the lowest doublets of the 
alkalis, although for the higher doublets the intensity ratio exceeds 2. 
This is because the spin-orbit energy actually mixes different configura- 
tions (?P states with the same 7 but different »); the amount of mixing is 
different for the two j values, so that the two radial functions are not the 
same. A small admixture of the low-intensity upper states in the high- 
intensity lowest ?P states has little effect, whereas in the opposite case 
there is a large effect on the doublet intensity ratio.’ 


1 In the general case in which Y1,’s with | > 1 are involved, it is often easier to use 
a formula for the integral of the product of three spherical harmonics given by J. A. 
Gaunt; see Condon and Shortley, op. cit., p. 176. Gaunt’s formula can be derived by 
making use of the Clebsch-Gordan coefficients. 

2 Condon and Shortley, op. cit., p. 238. 


+E. Fermi, Z. Physik 69, 680 (1929). 
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EFFECT OF A MAGNETIC FIELD: 


We now consider the effect of a magnetic field on the energy levels and 
transition intensities of an alkali atom. As discussed in Sec. 31, a con- 
tant magnetic field can be represented by the vector potential 


A=j3Hxr (48.8) 


and the extra energy associated with the orbital motion of an electron of 
charge —e is 


e€ 
2mc 


2 
H-L+ I H?r? sin? 6 (48.9) 
where L = r X p and 6 is the angle between r and H. 

The electron also has an intrinsic magnetic moment in the direction 
of its spin axis. The magnitude of this moment can be determined from 
comparison between experiment and the theory of the Zeeman effect 
presented below and is in agreement with the value deduced from Dirac’s 
relativistic theory of the electron (see Chap. 13); it is —eh/2mc, or —e/me 
times the spin angular momentum of the electron.? This is twice the 
ratio of magnetic moment to angular momentum of a classical charge 
distribution for which the ratio of charge to mass density is constant. 
The magnetic moment operator is —(e/mc)S, and the extra energy in a 
magnetic field is 


é 
ae H-s (48.10) 


The ratio of (48.9) to the kinetic energy is quite small for magnetic 
field strengths commonly attainable in the laboratory (see Prob. 6). It is 
therefore permissible to use perturbation theory to find the effect of the 
H terms on the wave functions and energy levels. In most cases, only 
the linear terms need be considered. However, for very strong fields and 
large orbits, the quadratic terms can become of interest (see the dis- 
cussion of the quadratic Zeeman effect below). Also, the diamagnetic 
susceptibility can be obtained from the terms in the energy that are 
proportional to H?. 


WEAK-FIELD CASE 


For the present, we consider only the first-order effects of H. The 

hamiltonian (48.1) then becomes, with (48.9) and (48.10), 
2 

= — Fv 4 Vir) + E)L-S + e(Le + 28,),€ = 


~ 2m 


1W. Heisenberg and P. Jordan, Z. Physik 37, 263 (1926). 


2 The quantity e4/2mc is called the Bohr magneton, and is equal to 0.927 X 10-7° 
erg /oersted. 


eH 


San (48.11) 
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where the field is along the z axis. The magnetic field can now be clas- 
sified as weak or strong according as the last term in (48.11) is small or 
large in comparison with the spin-orbit energy. The Zeeman effect usu- 
ally refers to the weak-field case, and the Paschen-Back effect to the 
strong-field case, although the term Zeeman effect is sometimes used to 
include all magnetic effects. 

In the weak-field case, we can make use of the wave functions (48.6), 
which are eigenfunctions of J? and J,. It is easily verified that the mag- 
netic energy «e(L, + 2S,) = «(Jz + S,) has matrix elements between states 
of different j but not between states of the same j and different m. We 
neglect the former, because of the relatively large energy separation 
between states of different 7. Thus the magnetic energy is diagonal with 
respect to m for each j and shifts the energy of each of the states (48.6) 
by its expectation value for that state. In each case, J, is diagonal, and 
so its expectation value is mh. The expectation value of S, for the *P; 
state with m = 4, for example, is 


Sf3-4[24(+)1V Fo + (—)t Yh lkho.3-424(+) V1, 
+ (—)¥ 13] sin 6 dé de 


= aff [2 +)t¥#, + (—)t¥F124(+) Pio 
— (—)¥i,] sin @ dé d¢ 


with the help of (41.5) and the orthonormality of the spin functions and 
the Y’s. Thus the magnetic energy of this state is «h($ + 3) = $eh. 
This and the similar results for the other states (48.6) can be expressed in 
terms of the Landé g factor; the magnetic energy is . 


emhg 
g = 4 for *P, g = § for ?P, g = 2 for *8, (48.12) 


The weak-field transition intensities are given directly by (48.7). 
According to the discussion of Sec. 46, the radiation from the transitions 
in which m changes by unity is circularly polarized when viewed along 
the field and linearly polarized perpendicular to the field when viewed in 
the zy plane; these are called the o components (from the German senk- 
recht). When m does not change in a transition, the radiation does not 
appear along the field and is polarized parallel to the field (7 components) 
when viewed in the zy plane. For observation in the zy plane, the 
intensity is proportional to |z|? in (48.7) and the o intensity is proportional 
to either |x|? or |y|? (but not to their sum). 
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STRONG-FIELD CASE 


If the magnetic energy is large in comparison with the spin-orbit energy in 
(48.11), the field is said to be strong. In this case the states within a 
given nl configuration are better specified by m, and m, than by j and mas 
in (48.6). The magnetic energy is then diagonal and has the value 


ch(m, + 2m,) (48.13) 


If the spin-orbit energy is neglected for the moment, the eight wave 
functions that correspond to (48.6), and their energy shifts (48.13), are 


(+) ¥i1 2eh 

(+) Yio eh 

5 (+) ¥i-1 0 

ee ae 0 
(—) Fie dh (48.14) 

Ey ae — Qh 

2 (+) ¥o,0 ch 

8 | (6. ath 


In the event that the magnetic field is very strong, the spin-orbit 
energy is most simply treated as a perturbation on the wave functions 
(48.14). We consider instead the general case, which includes all relative 
magnitudes of the magnetic and spin-orbit energies. This is done by 
working with the matrix of the last two terms in (48.11) in either of the 
representations (48.6) or (48.14). The eigenvalues of the matrix are the 
energy levels, and the transformation that diagonalizes the matrix gives 
the wave functions in accordance with the discussion of Sec. 31. We 
start from (48.14), and notice at once that the two 2S wave functions are 
the same as the 7S, functions of (48.6). We ignore the effect of the spin- 
orbit energy on these two states, since it does not shift them with respect 
to each other; the energy shifts due to the magnetic field are teh. Simi- 
larly, the first and last of the six ?P wave functions are the same as the 
*P, functions of (48.6) with m = +#; their energies are 3{ + 2eh, where 
¢ is given by (48.4). 

The four remaining *?P wave functions combine in pairs, according to 
whether m = mu + m, is equal to 4 or —3. It is enough to consider 
just one of these pairs, say that for which m = 3: (+)Yi,o and (—) Yi. 
The matrix of the magnetic and spin-orbit energies in the representation 
specified by these two states can be found with the help of the angular 
momentum matrices (27.26): 


ch 2-8 
Ee it (48.15) 
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In accordance with the discussion of Eq. (22.21), the eigenvalues of the 
matrix (48.15) are found by solving the secular equation 


ch — Xd 2-t¢ _ 
Q-te ap =N+ Gf —h)A—H(A+5) =0 
In this way we obtain for the energy shifts of these two states 
he = leh — dE + (CA? + AE + Be) (48.16) 


In the weak- and strong-field limits, the upper and lower signs in (48.16) 
lead to 


Ay > 43o + Beh and A > —o + Feh for * 0 
Ay — eh and A> - HE for & —0 
(48.17) 


Equations (48.17) show that the state that corresponds to the upper sign 
in (48.16) is the weak-field state 7 = $, m = % and the strong-field state 
m, = 0, m, = 3. Similarly, the lower sign in (48.16) corresponds to the 
weak-field state 7 = 3, m = and to the strong field state m = 1, 
Mm, = —%. 
The transition intensities can be found in the general case by calcula- 
ting the matrix elements of x, y, and z with the help of the eigenfunctions 
of e(r)L-S + e(L, + 2S.). These eigenfunctions are the first, sixth, 
seventh, and eighth of (48.14) and linear combinations of the other four 
functions that are obtained from the matrix that diagonalizes (48.15). 


QUADRATIC ZEEMAN EFFECT 


For very strong magnetic fields and large orbits or 7 values, effects of 
second order in H become appreciable. From (48.5), it is apparent that 
the effect of the spin-orbit energy becomes very small for large n, and a 
useful approximation is obtained by neglecting this part of the energy 
entirely. In this case the electron spin commutes with the hamiltonian, 
so that m, is a constant of the motion, and the spin can be ignored. The 
hamiltonian (48.11) is then replaced by 

h2 


V2 + Vir) + els + ime’r? sin? 6 (48.18) 
Since L, = —ihd/8¢ commutes with (48.18), mis a good quantum num- 
ber, and the only effect of the term «L, is to displace each energy level by 
the amount efm;. Thus for large n we need be concerned only with the 
effect of the last term H’ = }me?r? sin? 6 in (48.18), for particular values 
of m and ™,.! 


11. I. Schiff and H. Snyder, Phys. Rev. 55, 59 (1939). 
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It follows from the work of Sec. 16 that the effective radius of a 
hydrogen atom is roughly proportional to n?._ For alkali-atom states of 
large n, V(r) has practically the coulomb form, and the wave functions are 
very nearly hydrogenic functions. Thus H’ increases about as n‘4. This 
means that 7 is no longer a good quantum number for sufficiently large n. 
For smaller n, 1 may not be a good quantum number since H’ has off- 
diagonal matrix elements between states of different 1, and the unper- 
turbed energies of these states lie close together (they fail to be degenerate 
only because the wave functions for the smallest values of J penetrate 
the inner full shells). In this region, the perturbed energy levels can be 
found by diagonalizing the matrix of H’ for given values of n, m, and 
m., when the n — |m,| rows and columns are labeled by I. The structure 
of the H’ matrix can be inferred from Gaunt’s formula (footnote 1, page 
439) ; since sin? 6 can be expressed in terms of spherical harmonics of order 
0 and 2, the only nonvanishing matrix elements (/|H’|l’) = Hj, are for 
1—l'=0, +2. Thus the H’ matrix has the form (if, for example, 
m = 0) 


0 nu O Ay 0 
(48.19) 


The matrix (48.19) is equivalent to two independent matrices, one 
for even and the other for odd /, each with about 3n rows and columns. 
Direct diagonalization of these would be quite arduous for large n. How- 
ever, the resulting energy levels are so close together that they cannot be 
resolved spectroscopically, so that there is little reason to determine the 
individual levels. What can be observed is the aggregate of transitions 
between the *S ground state (J = 0) and the group of states that are 
obtained by diagonalization of (48.19); these appear to be a single 

broadened “‘line’”’. Allowed transitions occur only by virtue of the state 
with 1 = 1 that is mixed into each of the eigenfunctions of (48.19), so that 
m, can be only 0 or +1. This makes it possible to find the center of 
gravity of this line and its mean-square breadth without diagonalizing 
H’, as we now show. 

The unperturbed wave functions can be chosen so that H’ is a real 
matrix. Then the unitary matrix S that diagonalizes H’ can be real, so 
that Eq. (22.23) or Eq. (23.12) can be written 


SH'St = E (48.20) 
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where S* is now the transpose of S and E is diagonal. In terms of matrix 
elements, this equation is 


» SuxHSa = E65; 
kl 


The new eigenfunctions u; that correspond to the energy eigenvalues EF; 
are given in terms of the unperturbed wave functions 1 by (23.6): 


uy >= » Sar: 
l 


If now we neglect the dependence of the radiative transition probability 
on energy over the small range of energies involved in this group of states, 
the transition probability is proportional to the square of the amount of v1 
in each u;, or to Si?. Thus the energy levels E; should be weighted in 
proportion to S,,?.. The center of gravity of the group of perturbed 
energy levels is given by 


Ew = » ESi? = Hin 


since (48.20) can be inverted to give H’ = S'tES. In similar fashion, the 
the mean-square breadth of the line is 


5 (E; oo Egy)*Si1? =. > E278? — Ejay? 
= > Ai _ Ey? = HY; 
U 


Thus only two of the matrix elements of H’ need be calculated. It is 
apparent that both the displacement (apart from the factor ehm,) and the 
breadth of the line are proportional to H?. 


49C/MOLECULES 


Molecules are considerably more complex in structure than atoms, and 
correspondingly less has been accomplished in the quantitative application 
of quantum mechanics to molecular problems. In this section, a general 
account of the nature of molecular energy levels is followed by a simple 
explicit calculation for the hydrogen molecule and a somewhat more 
general treatment of diatomic molecules. 


1 For more detailed discussions, see G. Herzberg, ‘‘Molecular Spectra and Molecular 
Structure,” 2d ed. (Van Nostrand, Princeton, N.J., 1950); L. Pauling and E. B. Wil- 
son, Jr., “Introduction to Quantum Mechanics,” chaps. X, XII, and XIII (McGraw- 
Hill, New York, 1935); J. C. Slater, “Quantum Theory of Molecules and Solids,” 
vol. 1 (McGraw-Hill, New York, 1963). 
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CLASSIFICATION OF ENERGY LEVELS 


The simplifying feature that is the basis of all molecular approximations 
is the large ratio of nuclear mass to electron mass. As we shall see shortly, 
this implies that the energy associated with the motion of the nuclei is 
much smaller than that associated with the motion of the electrons about 
the nuclei. Since the period of a motion is of the order of h divided by its 
energy, the nuclear periods are correspondingly longer than the electronic 
periods. It is then a good approximation to regard the nuclei as fixed in 
calculating the electronic motion. Moreover, the nuclear motion can be 
calculated under the assumption that the electrons have their steady 
motion for each instantaneous arrangement of the nuclei (adiabatic 
approximation). 

The nuclei are expected to have a stable equilibrium arrangement 
somewhere between a completely collapsed structure (which is unstable, 
since the nuclei are positively charged and repel each other at short dis- 
tances) and a completely dispersed structure (which is not the most stable 
structure if a molecule exists). The nuclear motions can then be classified 
into translations and rotations of the quasi-rigid equilibrium arrange- 
ment and internal vibrations of the nuclei about equilibrium. As with 
atoms, the translational motion is the same as that of a free particle 
[see the discussion of Eqs. (16.5)] and gives rise to no nonclassical features. 

We thus arrive at a classification of molecular energy levels into 
electronic, vibrational, .and rotational types and proceed to estimate their 
relative orders of magnitude. Suppose that the molecule has linear 
dimensions of order a. Then the energy EF, associated with the motion 
of a valence electron (one that occupies roughly the whole of the molecular 
volume, rather than one that is bound in an inner shell close to a nucleus) 
is of order h?/ma?, where m is the electronic mass. This can be inferred 
by an argument like that given near the beginning of Sec. 9, by noting 
that the momentum uncertainty of the electron is at least of order i/a, so 
that its minimum kinetic energy is h?/ma?. We thus obtain 


hi2 


iy~ ma? 


(49.1) 
For values of a of the order of a few angstrom units, this corresponds 
to transition frequencies in the visible and ultraviolet regions of the 
spectrum. 

To estimate the vibrational energy, we regard each of the normal 
modes as a harmonic oscillator with which is associated a mass M and a 
stiffness constant Ky. M will be of the order of a typical nuclear mass. 
Ky can be estimated by noting that a displacement along a normal mode 
by the order of the molecular size a must produce an energy change of 
the order of the electronic energy E., since such a large displacement 
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would produce a substantial distortion of the electronic wave function; 
we thus put Ky ~ E./a?. Then the energy £, associated with a fairly 
low mode of vibration is, from (13.8), 


Ko\t 9 
B.~ (iP) ~ canis ~ ir) ® vee 


where use has been made of (49.1). E,is roughly a hundred times smaller 
than E, and corresponds to transitions in the near infrared. 

The rotational energy E, can be estimated from the moment of 
inertia of the molecule, which is of order Ma?. As would be expected, 
the angular momentum of a fairly low mode of rotation turns out to be 
of order h, so that 

h? m 
Ma M 
This is about a hundred times smaller than E, and corresponds to transi- 
tions in the far infrared. 

It might be expected from Eqs. (49.2) and (49.3) that the electronic, 
vibrational, and rotational energy levels can be obtained as successively 
higher orders in an approximation that is based in some way on the small 
ratio m/M (which is usually in the range 10- to 10-4). This was shown 
to be the case by Born and Oppenheimer.! They used as expansion 
parameter the ratio of a typical nuclear vibrational displacement to the 
internuclear distance (which is of order a). An oscillator of energy E, 
and stiffness constant Ko has a displacement of order 


(x) ~°(&) 


so that the expansion parameter is 


)~(3) ws 


In terms of this, the electronic energy is of zero order, the vibrational 
energy of second order, and the rotational energy of fourth order; the 
first- and third-order energies vanish. 


E,~ az E. (49.3) 


WAVE EQUATION 


The time-independent Schrédinger equation for a molecule is readily 
written down: 


nh? < wae 
(- ae ) V2 — 2M, V2 + v)¥- Ey (49.5) 
t=1 


1M. Born and J. R. pene Ann. Physik 84, 457 (1927). 
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There are 7 electrons and N nuclei, and V is the sum of the electrostatic 
interactions between all pairs of them. It is apparent that the nuclear 
kinetic-energy terms are of fourth order in the expansion parameter 
(49.4). If they are neglected, the wave function y involves the nuclear 
coordinates R; only parametrically, and (49.5) is a wave equation in 
the r; for the motion of the electrons with respect to nuclei that are fixed 
in space. In this case, y is approximately ug,(r;) and corresponds to 
the energy eigenvalue U(R;). The nuclear motion can then be found by 
regarding U(R,;) as a potential function and using it to obtain a nuclear 
wave function w(R;,). 
We therefore write y in the form 


V(r,R;) = Up, (ti)w(Rs) (49.6) 


where wu satisfies the equation 


2 n 
(- = 2 Voon v) ur, (Ti) = U(R;) ur, (ti) (49.7) 
For each arrangement of the nuclei, U(R,) is obtained as an eigenvalue 
of Eq. (49.7). There will, in general, be several solutions that corre- 
spond to different electronic states of the molecule; care must be taken to 
ensure that wu and U change continuously with R,, especially if the system 
is degenerate. Substitution of (49.6) into (49.5) gives, with the help of 
(49.7), 


N He 
| - > ou? + ve) |v = Ky 
j=l” 


which can be rewritten 


N 2 
ur, (Ii) | ~ ») aT, V7? + U(R;) —£E w(R;) 


N 
hi? 
=) saz RIV ual) + 2V0(R) * Viur(n)] (49.8) 
j=l” 
If now the dependence of u on R; is neglected, the right side of (49.8) 


drops out, and an approximate wave equation for the nuclear motion is 
obtained : 


N 2 
[- y 2 vie ue) | w(R,) = Ew(R,) (49.9) 
j=l” 


The neglect of the Vju terms derives physically from the smallness of 
the amplitudes of the nuclear motion in comparison with the equilibrium 
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internuclear distances [smallness of the expansion parameter (49.4)]; 
this implies that the electronic part u of the wave function does not change 
much as the nuclei move. Born and Oppenheimer showed formally that 
this approximation is justified so long as not too high vibrational and 
rotational modes are excited. 


THE HYDROGEN MOLECULE 


It is clear from the foregoing discussion that two distinct problems arise 
in connection with molecular structure. The first is the solution of Eq. 
(49.7) to obtain electronic wave functions and a potential energy function 
U(R,;) of the nuclear coordinates. The second is the solution of (49.9) 
for the nuclear motion. The first problem can be solved only in the 
simplest cases. As an example, we now consider in outline an approxi- 
mate solution for the hydrogen molecule due to Heitler and London.! 
Following this, we discuss the solution of Eq. (49.9) for a general diatomic 
molecule, making simple assumptions concerning the potential energy U. 

The only nuclear coordinate R; that appears in (49.7) in the case of 
the hydrogen molecule is the magnitude FR of the distance between the 
two hydrogen nuclei. The hamiltonian is that given in Eq. (32.11) (see 
Fig. 27); however, # is no longer large in comparison with ao = h?/me? 
so that the approximations implied in (32.11) and (32.12) are no longer 
useful. Nevertheless, an approximate wave function based on a simple 
product of two ground-state hydrogen-atom functions gives remarkably 
good results. The reason for this is that exchange degeneracy (see Sec. 
40) is taken into account; the degenerate wave functions for which elec- 
tron 1 is on nucleus A and electron 2 on nucleus B, and for which electron 
1 is on nucleus B and electron 2 on nucleus A, are both used at once. 
The new feature of the work of Heitler and London was the recognition 
that an appropriate linear combination of unperturbed degenerate wave 
functions gives a significantly lower energy than the separate wave 
functions; it is the basis of the present-day theory of homopolar binding 
in molecules. This property of degeneracy is sometimes referred to as 
resonance. An analogous situation is that in which an interaction 
between two classical oscillators that are in resonance (same unperturbed 
frequency) gives rise to a normal mode that has a lower frequency (and 
also one that has a higher frequency). In a similar way, an interaction 
between two resonant (degenerate) states in quantum mechanics gives 
rise to a lower energy eigenvalue (as well as to a higher one).2 There 
may of course be more than two degenerate unperturbed states, and the 
degeneracy need not be of the exchange type. 


1W. Heitler and F. London, Z. Phystk 44, 455 (1927). 


2 This use of the word ‘‘resonance”’ is only remotely related to that which appeared in 
Sec. 19 in connection with scattering. 
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POTENTIAL-ENERGY FUNCTION 
Equation (49.7) for the hydrogen molecule is 


[H — U(R)]ur(ti,r2) = 0 
i? 2 2 
H = — om (Vi? + V2?) 
+e (F A Ge ts ee =) (49.10) 


We wish to base our approximate calculation of U(R) on the approximate 
wave functions 


Ui(¥1,%2) = wa(i1)UB(T2) 


Uo(f1,02) = Ua(te)Ua(1) (49.11) 


where ua and ug are ground-state hydrogen wave functions [woo in the 
notation of (16.24)] based on nuclei A and B, respectively. It must 
first be noticed that the wu: and we of (49.11) are eigenfunctions of different 
unperturbed hamiltonians, so that the degenerate perturbation theory 
of Sec. 31 is not applicable. This is unlike the situation with the helium 
atom [see the discussion of (41.7)], where the two exchange-degenerate 
wave functions are solutions of the same unperturbed hamiltonian. 

We can, however, use the variation method of Sec. 32, in which case 
it is natural to adopt as the trial function an arbitrary linear combination 
of wi and ue 


¥(t1,02) = ui(ti,62) + Aue(ri,r2) (49.12) 


where A is the variation parameter. Substitution of (49.12) into (32.5) 
gives 
(1 + A*2)Hy, + 2AH - 
U(R) < a eo 7 ae y = SSurue dr; dire 
Ay = Hy = ffui:Huid*ri d*r. Hy = Aa 

= ffu:Huzd*r,d*r_, (49.13) 


These equalities between matrix elements are easily established with the 
help of Eq. (23.21) when it is remembered that the w’s are real and that 
H is hermitian and symmetrical in the two electrons. 
The matrix elements and y depend on R. For any particular value 
of R, the derivative of the right side of (49.13) with respect to A is 


2(1 — A’)(Ai — vHu) 
(1 + A? + 2Ay)? 


which is zero for A = +1. Since the right side of (49.13) is equal to 
Hy, when A is — ©, 0, and +, one of the points A = +1 must bea 
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minimum and the other a maximum. The integrals in (49.13) can be 
expressed in terms of tabulated functions, and the minimum expectation 
- value of H is obtained with A = +1: 


y= ut Use 
Hut Ai (49.14) 
U(R) < =Bs 


The upper limit on U(R) given in (49.14) has the general appearance that 
is characteristic of the internuclear potential energy of a diatomic mole- 
cule (see Fig. 39) and is in good agreement with experiment.' Since y in 
(49.14) is symmetric in an interchange of the space coordinates of the 
two electrons, it must be multiplied by the antisymmetric singlet spin 
function given as the last of Eqs. (41.6). 

It is interesting to compare the symmetry characters of the ground 
state of the hydrogen molecule and the excited states of the helium atom 
considered in Sec. 41, from a physical point of view. Because of the 
exclusion principle, electrons must be in different space states if they 
have parallel spins and hence tend to keep away from each other. In the 
excited 1s2s state of helium, for example, this reduces the electrostatic 
repulsion of the electrons and lowers the energy. Thus the triplet states 
of helium tend to lie lower than the singlet state of the same configuration. 
(The situation is different for the ground state, since only the singlet state 
can exist for the 1s? configuration.) In the ground state of the hydrogen 
molecule, on the other hand, the lowest energy (strongest binding) is 
obtained when the electrons tend to concentrate between the two nuclei, 
since then the repulsion between the electrons is more than compensated 
by the attraction of both nuclei for each electron. This occurs when the 
electrons can occupy the same space state and hence when they have 
antiparallel spins. Thus it is the singlet state that leads to a stable 
molecule. 


THE MORSE POTENTIAL 


We now turn to diatomic molecules in general and consider the nature 
of the solutions of Eq. (49.9) for the nuclear motion. If the nuclei have 
masses M, and M, and their relative position vector # has polar coordi- 
nates R, 6, ¢, the equation for the relative motion becomes [see Eqs. 
(16.5)] 


| - ce v2 + ue) | w(R,0,¢) = Ew(R,8,¢) (49.15) : 


where M = M,M;/(Mi + M,) is the reduced mass. 
1 Pauling and Wilson, op. cit., sec. 43a. 
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Fig. 39 The Morse potential (49.16), with a = 4Ro. 


It has been found by experience that the potential-energy function 
for the lowest electronic states of actual diatomic molecules can be repre- 
sented quite accurately by a simple analytic function that contains three 
adjustable parameters 


U(R) = Uo(e-2(2-Fala — Qe-(R—-Ry)/2) (49.16) 


Equation (49.16) represents the Morse potential! and is drawn in Fig. 39. 
U approaches zero exponentially for large R, has the minimum value 
—U, at R = Ro, and becomes large and positive as R approaches zero if 
the “breadth” a of the attractive region is somewhat smaller than the 
equilibrium distance Ro. . 

Figure 39 has the general appearance that would be expected for a 
diatomic molecule. The zero of energy is arbitrarily chosen to occur 
when the neutral atoms are far apart; then U becomes negative at first 
as & decreases, because of the van der Waals attraction.2. For smaller R 
this is replaced by the much stronger Heitler-London resonance attrac- 
tion. As & continues to decrease, the close approach of the nuclei (or 
ionic cores) gives rise to a repulsion that causes U to increase and eventu- 
ally become large and positive.* 


1P. M. Morse, Phys. Rev. $4, 57 (1929). 


? One of the inaccuracies of the Morse potential is its replacement of the 1/R® van 
der Waals term (see the end of Sec. 32) by an exponential; however, the behavior of 
U at such large F# has little influence on molecular energy levels. 


® Unlike the true interaction, the Morse potential is finite at R = 0. 
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ROTATION AND VIBRATION OF DIATOMIC MOLECULES 


Equation (49.15) can be separated in spherical coordinates, as was Eq. 
(14.1), to give 


w(R,0,¢) = xe) Yxu;,(9,¢) 


K and Mx are the angular momentum quantum numbers that are 

analogous to | and m, respectively, for a single particle in a central 
field. The radial equation is 

ee 

2M dk? 


W(R) = U(R) + 


+ W(R)x = Ex 


WK(K + 1) 
2M R? 


Equation (49.17) is the wave equation for the one-dimensional 
motion of a particle of mass M in a potential W(R), with the boundary 
condition that x vanish at R = 0. If K is not too large, the general 
shape of W resembles that of U shown in Fig. 39. In this case, we are 
primarily interested in vibrations of small amplitude about the minimum. 
We can then expand W about its minimum at R,, which is only the same 
as Ry if K = 0, to give 


W(R) = Wo + 4K(R — Ri)? + 0(R — Ri)? + c(R — Ri)* (49.18) 


where higher-order terms are neglected. If the b and c terms are also 
neglected and the domain of R is extended to — ~, the eigenvalues of 
(49.17) are those of a linear harmonic oscillator with an additive term 
W.. This is a good approximation for moderate values of the rotational 
quantum number K and the vibrational quantum number v. A somewhat 
better approximation can be obtained by regarding the b and ¢ terms in 
(49.18) as perturbations on the oscillator. Since the b term produces 
only a second-order effect (see Prob. 2, Chap. 8), whereas the c term 
appears in first order (its expectation value can be computed by matrix 
methods), both make contributions to E that have the same order of 
magnitude. 


(49.17) 
K=0,1,2,... 


ENERGY LEVELS 


The eigenvalues of (49.17) to lowest order in b and ¢ are then 
a Ko\} 1 R26? [15 1\2 7 
co cen) (4) ELEC) + 
3h?c 1\2.. 1 
+ 2M Ky (+ + 3) a ‘| v=0,1,2,... (49.19) 


Wo, Ko, 6, and c can all be expanded in powers of K(K + 1), where the 
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coefficients depend on the parameters of the function U(R). If U has 
the form (49.16), the following expressions can be obtained: 


2 2 
Fe fee eae 


2MR,3U5 

Ot WK(K +1) _ A‘K*(K + 1)%a? 

We SU ace 2MR,? 4M?R°U 5 
(49.20) 

K, — 2U0 _ 3WK(K +1) a (,_ a 

m a? MR,?2a? Ro Ro 

_ Us _ 7U 
alee ¢= i944 


Only enough terms have been retained to give E correctly to second order 
inv +4and K(K + 1). 

The first of Eqs. (49.20) shows that the molecule stretches owing to 
rotation. The second equation is the equilibrium energy — Uo, plus the 
rotational energy to second order. The first-order rotational energy is 
h?K(K + 1)/2Io, where In = MR,? is the moment of inertia of the mole- 
cule about an axis perpendicular to the line joining the nuclei; this energy 
is the same as for a rigid rotator (see Prob. 12). The third equation 
includes the change in the stiffness due to stretching. The corrections 
for stretching in the anharmonic terms b and c can be neglected to this 
order. The second term on the right side of (49.19) can be expanded 
with the help of the expression for Ko to give 


2U\'(  1\[, _ 3@KK +1 af, « 
a Ha) (o+5) [2 IMRU, Ro (2 %)| 


The last two terms in (49.19) give the second-order vibrational energy 


15, 7\ # h? 
(- By ie) ae aoe (0 - 3) o~ 545 (> if 3) (49.21) 
since the constant factors cancel. 
It is apparent that the rotational and vibrational energy levels agree 
in order of magnitude with the estimates made at the beginning of this 
section. As either v or K increases, the spacing between levels becomes 


smaller than that predicted from the simple rigid rotator and harmonic 
oscillator. 


EFFECT OF NUCLEAR IDENTITY 


In the event that the two nuclei of a diatomic molecule are identical, the 
wave function must be symmetric with respect to an interchange of their 
space and spin coordinates if the nuclei have zero or integer spin, or anti- 
symmetric if they have half-odd-integer spin (see Sec. 41). The dis- 
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cussion of Sec. 14 shows that the parity of the nuclear wave function is 
determined by the angular function Y x»,(6,¢) and is even or odd accord- 
ing as K is even or odd. An interchange of the space coordinates of the 
two nuclei is equivalent to a change in sign of their relative position 
vector R, so that the parity determines the space symmetry of the wave 
function. We thus see that for nuclei of zero or integer spin, the spin 
function must be symmetric for even K and antisymmetric for odd K; 
for nuclei of half-odd-integer spin, the spin function must be antisym- 
metric for even K and symmetric for odd K. 

The discussion of Sec. 41 shows that for two nuclei of spin Ih each, 
the total of (27 + 1)? spin states can be divided into (J + 1)(2I + 1) 
symmetric states and J(2J + 1) antisymmetric states. Thus, in a gas 
that is in statistical equilibrium, the ratio of the number of molecules 
with even K to the number with odd K will be (J + 1)/Z if I is zero or an 
integer, and J/(I + 1) if J is half an odd integer.’ This effect gives rise 
to alternating intensities in the band (rotational) spectrum of homonu- 
clear diatomic molecules. Both the spin and the statistics of appropriate 
nuclei can be determined in this way, and the results are in agreement 
with the general statement in Sec. 41. 


5SOLIATOMIC NUCLEI 


The application of quantum mechanics to the investigation of the struc- 
ture of atomic nuclei entails great mathematical complexities in all but 
the simplest cases. In this section we mention very briefly some general 
properties of nuclei and then consider the nuclear two-body problem.’ 


GENERAL PROPERTIES OF NUCLEI 


Atomic nuclei consist of protons and neutrons, both of which are called 
nucleons. Other particles such as mesons, which have a transient exist- 
ence within nuclei, are usually ignored in structure theories. Protons 


1 This ratio is, of course, modified by the Boltzmann factor if the spacing between 
rotational levels is not small in comparison with the thermal energy «xT. 

2 For more detailed discussions at various levels, see W. E. Meyerhof, “Elements of 
Nuclear Physics’? (McGraw-Hill, New York, 1967); M. A. Preston, “Physics of the 
Nucleus” (Addison-Wesley, Reading, Mass., 1962); R. D. Evans, ‘The Atomic 
Nucleus” (McGraw-Hill, New York, 1955); A. E. S. Green, “Nuclear Physics’ 
(McGraw-Hill, New York, 1955); R. G. Sachs, ‘“‘Nuclear Theory” (Addison-Wesley, 
Reading, Mass., 1953); G. Gamow and C. L. Critchfield, ‘“Theory of Atomic Nucleus 
and Nuclear Energy Sources’? (Oxford, New York, 1949); J. M. Blatt and V. F. 
Weisskopf, ‘“‘Theoretical Nuclear Physics’ (Wiley, New York, 1952); M. G. Mayer 
and J. H. D. Jensen, “Elementary Theory of Nuclear Shell Structure’ (Wiley, New 
York, 1955); A. de-Shalit and I. Talmi, “Nuclear Shell Theory” (Academic, New 
York, 1963); E. J. Konopinski, ‘‘The Theory of Beta Radioactivity” (Oxford, London, 
1966). 
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are nuclei of hydrogen atoms, and neutrons are particles that have about 
the same mass, no electric charge, and the same spin and statistics as 
protons (spin $h, Fermi-Dirac statistics). A nucleus can be characterized 
by its charge Ze, where Z is an integer and ¢ is the positive proton charge, 
and its mass M, which is measured in units of +s of the mass of O18 
(oxygen isotope of mass number 16). M is always found to be close to 
an integer A, called the mass number. The number of neutrons in a 
nucleus is equal to A — Z; thus the deuteron (heavy-hydrogen nucleus) 
H? consists of one proton and one neutron, the alpha particle (helium 
nucleus) He? of two protons and two neutrons, and the gold nucleus Au!” 
of 79 protons and 118 neutrons. 

According to the theory of relativity, the difference between the sum 
of the masses of the Z protons and A — Z neutrons in a nucleus and the 
mass M of that nucleus, multiplied by the square of the speed of light, is 
the energy evolved when the separate nucleons are brought together to 
form the nucleus. This energy is called the binding energy of the nucleus 
and is conveniently measured in units of a million electron volts (Mev). 
The nuclear radius R is a reasonably well-defined quantity. It can be 
measured in several ways, for example, from the scattering of high- 
energy neutrons, protons, and electrons. It is found experimentally that 
both the binding energy per nucleon and the volume per nucleon are 
roughly constant over most of the periodic system. The former is about 
8 Mev, and the latter is usually expressed in the form R = r,A!, where 
ro ~ 1.2 to 1.4 X 10-8 cm. The approximate constancy of the binding 
energy and volume per nucleon is a consequence of the saturation property 
of nuclear forces. ' 


TWO-NUCLEON INTERACTION 


The most fundamental problem to be solved in connection with nuclei is 
the determination of the parameters of the interaction energy between 
pairs of nucleons. It is possible that once these are known the problem 
of the structure of nuclei heavier than the deuteron would become merely 
an exceedingly complicated exercise in the application of quantum 
mechanics. This situation would be analogous to that which obtains 
in atomic and molecular structure, where the main interaction is known 
to be given by Coulomb’s law. On the other hand, it is possible that 
knowledge of the two-body interactions is not enough to determine the 
structure of heavier nuclei, even in principle. This would be the case if 
there were additional interactions which occur only when three, four, or 
more nucleons are close together, so that their existence and character 
could not be inferred from a study of the two-nucleon system. The 
question of whether or not appreciable many-body interactions exist has 
not yet been settled, and we shall not discuss it further here. 
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We devote the remainder of this chapter to the two-nucleon system 
and assume that the main interaction is of short range. It is reasonable 
to expect this range to be substantially less than the radii of heavy nuclei, 
and calculations of the type described below show that it is of the order 
of 2 X 10-7 em. We put aside for the present any possible dependence 
of the interaction potential energy V(r) on quantities other than the 
magnitude of the separation distance r between the two nucleons. Our 
first problem then is to solve the Schrédinger equation for the relative 
motion of two particles of reduced mass yu in the potential V(r). Since 
neutrons and protons have about the same mass, u is very nearly equal 
to half the mass of either of them. 


NEUTRON-PROTON SYSTEM 


A simple assumption for the shape of V(r) is the square well form (see 
Fig. 13, page 83): V(r) = —Vo for r < a, V(r) =O forr >a. It was 
shown in Sec. 15 that there is no bound state of a particle of mass yu in this 
potential unless Voa? > 7h?/8u, which is equal to 1.0 X 10-24 Mev-cm? 
for the neutron-proton system. If we assume that a ~ 2 X 10-" cm, 
Vo must exceed 25 Mev in order for the deuteron to exist. Since the 
deuteron has only one bound state, it is reasonable to suppose that it 
corresponds to! = 0. Then the solution of Prob. 7, Chap. 4, shows that 
the measured binding energy of approximately 2.23 Mev is obtained with 
Vo =~ 36 Mev. 

The scattering cross section for neutrons of very low energy on 
protons can then be obtained from Eq. (19.28). If we neglect E in 
comparison with Vo, we find that « ~ 3.6 X 10-*4cm?. The measured 
cross section for neutrons of a few electron volts energy in hydrogen is 
about 20.4 X 10-24 em?; this energy is small enough so that it can be 
neglected in comparison with Vo and large enough so that the binding of 
the proton in the hydrogen molecule does not affect the result. An 
explanation of this discrepancy in terms of the dependence of the neutron- 
proton interaction on the spin state was suggested in 1935 by E. Wigner 
(unpublished). The deuteron is known to have spin hf and so is in a 
triplet spin state. However, as discussed in Sec. 43 in connection with 
exchange collisions of electrons with hydrogen atoms, the colliding neu- 
tron and proton will be in a triplet state in three-fourths of the collisions 
and in a singlet spin state in one-fourth of the collisions. Thus the 
discrepancy is removed if the singlet cross section is taken to be about 
70 X 10-?4 cm?. 

If we assume that a is also equal to 2 X 10-™ cm for the singlet 
interaction, Eq. (19.28) shows that this cross section is obtained for a 
potential depth of either 24 or 27 Mev. It is apparent that this is 
the resonance scattering case discussed in Sec. 19 and that these two 
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potentials correspond to virtual and bound singlet states, respectively. 
The decision as to which is correct cannot be made on the basis of the 
dependence of the scattering cross section on incident neutron energy; it 
is shown in Sec. 19 that, with 1 = 0, ¢ is a monotonic decreasing function 
of E in both cases, and there is not enough difference in the behavior of 
the two functions. It is found from other considerations! that the singlet 
state is virtual, so that the depth corresponding to this range is about 
24 Mev. 


ARBITRARY SHAPE OF POTENTIAL 


The interaction potential energy between a pair of nucleons is character- 
ized by a short range a and a large magnitude V». Here, a and Vp do not 
apply only to the square well shape but rather refer to the distance within 
which V(r) is appreciably different from zero and to the approximate 
magnitude of V(r) in this region, respectively. For collisions of moderate 
energy, up to a few Mev, ka is fairly small compared with unity, where 
k = (2uE)*/h and E is the kinetic energy in the center-of-mass coordinate 
system. For example, with a = 2 X 10-" em, ka is equal to unity when 
the energy of the incident nucleon in the laboratory system is about 20 
Mev. Thus for moderate energies, only the 1 = 0 partial wave need be 
considered. Also, both E and e, the binding energy of the deuteron, are 
fairly small in comparison with Vo. It follows that the form of the 
1 = Q radial wave function depends only slightly on the energy within 
the potential range and has its simple asymptotic form outside this 
range. This suggests that the binding and the low-energy scattering 
produced by such a potential should depend primarily on the “strength” 
of the potential, measured roughly by Voa?, and on the distance at 
which the wave function attains its asymptotic form, measured roughly 
by a. 

It turns out that any strong, short-range, predominantly attractive 
potential can actually be represented by two parameters, which may be 
chosen to be a strength and a range, and which between them specify the 
bound-state energy —e and the dependence of the scattering phase shift 
on energy for moderate values of E.2_ Thus low-energy experiments can 
be expected to determine only these two parameters and not the shape of 
the potential V(r); this expectation is confirmed by the experimental 
results. 


1 Most reliably from the scattering of very slow neutrons in ortho- and parahydrogen; 
the possibility of such a determination was first pointed out by J. Schwinger and 
EK. Teller, Phys. Rev. 52, 286 (1937). 

? Although suspected earlier, this result was first established by J. Schwinger in 
unpublished lectures (1947), using a variational method; the present treatment follows 
that of H. A. Bethe, Phys. Rev. 76, 38 (1949). ; 
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RELATIONS FOR THE PHASE SHIFT 
We work entirely with the 1 = 0 partial wave, and let u(r) be the product 
of r and the radial wave function. The normalization of u is so chosen 
that its asymptotic form is 


u(r) > ¥(r) (50.1) 
outside the range of the potential, where 
__ sin (kr + 4) 
V(r) = —— (50.2) 


for all r. The phase shift 5 agrees with definition given in Eq. (19.8), 
and the total cross section is 


— a sin? 8 (50.3) 


as in Eq. (19.13). 
The wave equations for particular energies FE, and Ez are 


2 ; 
ou + kyu, — Uu, = 0 

50.4 
Be ate (50.4) 
ye t hePue — Uue = 0 


where U(r) = 2uV(r)/h?. We multiply the first of Eqs. (50.4) by us, 
the second by w, and integrate the difference between them over r from 
r = 0 to a distance R& that is somewhat larger than the range of the 
potential: 


te et — y, 
?“dr dr 
The y’s satisfy the same Eqs. (50.4) as the u’s, except that the U terms 
are absent. Therefore Eq. (50.5) holds for the y’s as well: 


dry ds 
> “dr dr 
If now Eq. (50.5) is subtracted from Eq. (50.6), the contributions to the 


left side from the upper limit R cancel, since u(R) = ¥(R). Then the 
limit 2 — © can be taken, to yield 


dy. dy dus dui 
(v.% ia ey = (ue =e me) 


: = (Is? — fy?) f° (Vive — usu) dr (50.7) 


R 
= (ky? — ky’) i Uiu2 ar (50.5) 
0 0 


: = (ka? — ky?) [" yada dr (50.6) 
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From Eq. (50.2), the first parenthesis on the left side is equal to kz cot 2 — 
k, cot 6;. Also, since u(0) = 0, the second parenthesis on the left side 
is zero. Thus Eq. (50.7) can be written 


ke cot 62 — ky cot 61 = $(ke? — ky?) p(Ej,E2) 


-) 50.8 
p(E,E 2) 2 I, (ive = U1U2) dr ( ) 


A modification of Eqs. (50.8) that is of interest is obtained by 
replacing E; by —e, ¥i(r) by ¥,(r) = e-*", where 6? = Que/h?, and u(r) 
by the ground-state wave function u,(r) normalized in analogy with 
(50.1). The result is 


ke cot 62 + B = 3(k2? + B?)p(—e,E2) 


° (50.9) 
p(—¢,E2) = 2 I (Wo2 — Upte) dr 
Another modification consists in allowing EZ; to become zero: 

1 1 

ke cot d2 + a, = 3 ko?p(0,F2) 

p(0,E2) = 2 A (Powe — wove) dr (50.10) 
ex r | eC 

Ww =1 2 Ti lim (% eot 6) 


where the subscripts on uo and po denote zero energy. The quantity a, 
is the scattering length defined below Eq. (39.39); from Eq. (50.3) the 
scattering cross section at zero energy is equal to 4za,2.. The subscript on 
a, implies that it refers to the triplet, not the singlet, interaction. 


EFFECTIVE RANGE 


Equations (50.8) to (50.10) are exact. We now make an approximation 
with regard to p that is suggested by the general form of the potential. 
It is apparent that Eq. (50.1) makes the integrand of p vanish outside 
the potential. Inside the potential, all the y’s are very nearly equal to 
unity since kr and @r are small in comparison with unity; also, the u’s 
are very nearly equal to each other since U is much larger than k? or 8?. 
Thus p depends only slightly on its arguments and can be calculated for 
any convenient energies. Our approximation, then, is to replace p in 
the above equations by 


r, = p(0,0) = 2 L (Yo? — uo?) dr (50.11) 


which is the effective range introduced in Sec. 39; again, the subscript on 
r implies that it refers to the triplet interaction. Alternatively, the 
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effective range could be defined, for example, as 
pl—e—6) = 2 f° Wo? — we?) ar (50.12) 


It is shown in Prob. 17 that the effective ranges computed from Eqs. 
(50.11) and (50.12) in a typical case agree with each other within a few 
percent. 

With this approximation, the scattering phase shift is given by Eq. 
(50.9) as (dropping the subscript 2) 


keot 6 +6 = 4rtk? + 6) (50.13) 
and by Eq. (50.10) as’ 


k cot 6 + + = irk? (50.14) 
t 


Equations (50.13) and (50.14) are the same as Eqs. (39.40) and (39.39), 
respectively, when 8 is identified with x and use is made of (39.41). 
Comparison of (50.13) and (50.14) gives the following relation between 
a,, 8, and r;: 


1 Pa 
5, w 8 ee (50.15) 


which is the same as (39.42). 

Hither of the quantities 8 or a, may be thought of as a strength 
parameter for the potential and r; as a range parameter; however, 1/8 
and a, differ enough so that specification of any two of the three fixes the 
third. Thus, according to this shape-independent effective-range theory, 
all the binding and low-energy scattering properties of the potential are 
determined by just two parameters for each spin state. The experimental 
results show that this is actually the case and so confirm the assumption 
of a strong, short-range interaction on which the theory is based. 

Experimental values are 1/8 = 4.32 X 10-13 em for the triplet spin 
state (e« = 2.225 Mev), a, = 5.40 X 10-™ em, and 7, = 1.72 X 10-% em; 
these are consistent with Eq. (50.15). The corresponding square well — 
interaction has a depth of 35.5 Mev and a radius of 2.03 X 10-%em. For 
the singlet spin state, a, = —23.7 X 10-13 cm, where the negative sign 
means that there is no bound state; the effective range is not very 
well determined experimentally but probably lies between 2.5 and 
2.7 * 10-3 cm.'!' A square well of depth 16.8 Mev and radius 
2.37 X 10~!3 cm corresponds to the above a, and r, = 2.47 X 10-!3em; 
the depth 14.2 Mev and radius 2.57 X 10- em corresponds to 
r, = 2.68 X 1071? cm.? 


1H. P. Noyes, Nucl. Phys. 74, 508 (1965). — 
? The square-well parameters were kindly supplied by B. F. Gibson and A. Goldberg. 
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EXCHANGE OPERATORS 


The spin dependence of the neutron-proton interaction noted above can 
be expressed in terms of the spin-exchange operator $(1 + én + ép), where 
éy and ép are the Pauli spin matrices for the neutron and proton, respec- 
tively, defined as in Eqs. (27.30) or (41.3). As shown in Prob. 18, a 
triplet spin function is an eigenfunction of this operator with eigenvalue 
+1; similarly, a singlet spin function is an eigenfunction with eigenvalue 
—1. Equations (41.6) show that a triplet function of two spin } particles 
is symmetric in an interchange of the spins and hence has eigenvalue +1 
for an exchange operator; similarly, a singlet function is antisymmetric 
and hence has eigenvalue —1 for an exchange operator. Thus the above 
operator actually does have the effect of interchanging neutron and proton 
spins. . 

A space-exchange operator! multiplies wave functions of even / by + 
and wave functions of odd 1 by —1; it has no effect on the results obtained 
thus far, which are all for! = 0. For higher energy scattering, the discus- 
sion of Sec. 19 shows that the partial wave with / = 1 can be significant. 
If the phase shift 5; for 1 = 1 is small in magnitude and the higher phase 
shifts can be neglected, Eq. (19.32) can be approximated as 


o(0) = a (sin? 5) + 361 sin 280 cos 6) 


For energies high enough so that 6; is appreciable, the / = 0 phase shift 
50 is likely to be between 0 and 90°, so that the sign of the angular asym- 
metry is determined by the sign of 43. 

If the interaction were predominantly of the non-space-exchange 
type, the potential would be negative (attractive) for 1 = 1, and 81 
would be positive. Then neutrons incident on protons would be pref- 
erentially scattered forward in both the center-of-mass and laboratory 
coordinate systems. If, on the other hand, the space-exchange operator 
were to dominate the interaction, the potential energy would be repulsive 
for 1 = 1, and 5, would be negative. Then the neutrons would be pref- 
erentially scattered backward in the center-of-mass system or at right 
angles in the laboratory system, and the protons would tend to recoil in 
the forward direction in both systems. This effect can be regarded 
physically as a non-space-exchange type scattering accompanied by an 
exchange of identity between the neutron and the proton. 

Moderate-energy scattering experiments show that the differential 
cross section is nearly symmetric about 90° in the center-of-mass system. 
If it were exactly symmetric, the simplest and most likely situation 


1 This was introduced by E. Majorana, Z. Physik 82, 137 (1933), as a modification 
of the original suggestion of W. Heisenberg, Z. Physik 77, 1 (1932). 
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would be either that all the even | phase shifts are zero or that all the 
odd / phase shifts are zero. The first case is impossible, since it is known 
that 55 #0. Weare thus led to the Serber interaction as a rough approxi- 
mation to the space-exchange character of the neutron-proton potential 
at moderate energies: The coefficients of the space-exchange and non- 
space-exchange parts of the neutron-proton potential are equal and of 
the same sign, so that there is no interaction in odd 1 states. 


PROTON-PROTON SCATTERING 


The scattering of fast protons in hydrogen can be treated by the methods 
outlined in Sec. 21, with suitable allowance for the identity and spin of 
the two colliding protons (see Sec. 41). When the interaction is a pure 
coulomb field, combination of Eqs. (21.10) and (41.2) yields the Mott 
scattering formula! 


2 2 
o(@) = (5) {cosee 40 + sect 40 
2 
— cosec? 4@ sec? 36 cos E In (tan? || (50.16) 


which is expressed in the center-of-mass coordinate system. This for- 
mula represents only the experiments for protons of less than about 0.2 
Mev bombarding energy, since for higher energies the protons approach 
closely enough so that the specifically nuclear interaction becomes 
appreciable. Up to several Mev energy, only the 69 term in (21.27) need 
be included. It must be remembered that, because of the exclusion 
principle, the partial wave with / = 0 is associated with a singlet spin 
state. Thus experiments with fast protons are required if information 
concerning the triplet proton-proton interaction is to be obtained. An 
effective-range theory can be made for the singlet proton-proton inter- 
action and leads to roughly the same parameters as for the singlet 
neutron-proton interaction. 


PROBLEMS 


1. Find an expression for the electron density n(r) in the Thomas-Fermi model in 
terms of the dimensionless function x, and show that the radius of a sphere that 
encloses a fixed fraction of all the electrons is proportional ta Z-}. 


1N. F. Mott, Proc. Roy. Soc. (London) A126, 259 (1930). 4 is half the proton mass, 
and v is the relative speed. 

? The nucleon-nucleon interactions for a wide range of energy are reviewed by A. E. S. 
Green, M. H. MacGregor, and R. Wilson, Rev. Mod. Phys. 39, 498 (1967), and other 
articles in the same issue of that journal. 
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2. Use Lagrange’s method of undetermined multipliers to show that the condition 
that the integral in (47.9) is stationary, when the w’s are varied but kept normalized, 
is given by (47.11). 

3. Two p electrons (J = 1) can have L = 0, 1, or 2 and S = 0 or 1 in the Russell- 
Saunders case. Are all combinations of Z and S permitted if the n values of the two 
electrons are different? Are they all permitted if the n values are the same? 


4. Show that spin wave functions in the Russell-Saunders case that are eigenfunc- 
tions of S? with different values of the total-spin quantum number S are orthogonal. 
Use a method like that employed in Sec. 10 to show that energy eigenfunctions 
corresponding to different eigenvalues are orthogonal. 


5. Show by direct computation that J = L +S commutes with L-S and hence 
with the hamiltonian (48.1). 


6. Estimate the ratio of the term in (48.9) that is linear in H to the kinetic-energy 
term for a hydrogen atom in a magnetic field of 105 gauss. 


7. Show that the Landé g factor for a one-electron state, analogous to (48.12) but 
applicable for j = 1 + 3, is 


_@a+iti 
~ +1 


Make use of the extension of the Clebsch-Gordan coefficients (28.13) to the case 
ji = 1, je = }. 

8 Estimate the magnitude of the magnetic field strength for which the two perturba- 
tion terms in (48.11) are equal for an alkali atom. 

9. Construct a diagram that shows the relative displacements and intensities of 
the + and « Zeeman components of the *P — 2S transitions in an alkali atom when the 
magnetic field is weak. Construct a similar diagram when the field is strong. 


10. Show that the ratio of the distance traveled by the nuclei of a molecule during 
a period of the electronic motion to the dimensions of the molecule is of order (m/M)? 
in the case of vibrational motion, and (m/M) in the case of rotational motion. Do 
these results justify an adiabatic type of approximation? 


11. Does the internuclear potential (49.14) obtained for the hydrogen molecule 
approach the —1/R* form obtained in Sec. 32 for large R? If not, why not? 


12. Set up and solve the wave equation for a rigid rotator that has no kinetic energy 
of rotation about a particular axis and has equal moments of inertia about the two 
perpendicular axes. 

13. Derive the selection rules for transitions between rotational levels of a diatomic 
molecule. 

14. Discuss the selection rules for vibrational transitions when the two nuclei of a 
diatomic molecule are the same and when they are different. 


15. Calculate the contributions to the vibrational energy of a diatomic molecule of 
the neglected fifth- and sixth-power terms in the expansion (49.18), and show that their 
neglect is justified in arriving at the (v + 4)? energy given in (49.21). Use matrix 
methods to get the needed matrix elements of z5 and x¢ for a harmonic oscillator. 

16. Suppose that the interaction between a neutron and a proton is the same in 
the singlet as in the triplet state and is represented by a square well. Is there any 
value of a that will fit both the deuteron binding energy (J = 0) and the slow-neutron 
scattering cross section? If so, what is it? 
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17. Use the value of the binding energy of the deuteron to calculate the triplet 
effective range of a square well potential from Eq. (50.11) and from Eq. (50.12), 
and compare them with each other. Assume that a = 2 K 1073 em. 


18. Show that the spin-exchange operator 4(1 + éy : ép) has the properties ascribed 
to it in the text. Then make use of Eqs. (23.38) to construct triplet and singlet pro- 
jection operators. Finally, if the triplet potential energy is V,(r) and the singlet is 
V(r), construct a single spin-dependent potential energy that describes the neutron- 
proton interaction. 


19. Make use of the work of Sec. 21 to obtain an expression for the ratio of the proton- 


proton scattering with a phase shift 6) (and no others) to the Mott scattering given 
by Eq. (50.16). 


13 
Relativistic Wave Equations 


In this chapter we extend the nonrelativistic Schrédinger wave equation 
to the description of the motion of a particle that has a speed approaching 
that of light. This extension can be made in many ways, each of which 
is consistent with the Lorentz transformation equations of the special 
theory of relativity.!. A characteristic feature of relativistic wave equa- 
tions is that the spin of the particle is built into the theory from the 
beginning and cannot be added afterward, as Pauli added the electron 
spin to Schrédinger’s nonrelativistic equation. This feature provides a 
useful gauge of the applicability of a particular equation to the description 
of a particular kind of particle. Two relativistic equations are con- 
sidered here: the spin 0 equation due to Schrédinger that has since been 
found to describe a + meson, and the spin $ equation due to Dirac that 


1 For a review of special relativity, see, for example, H. M. Schwartz, “Introduction to 
Special Relativity,’’ chaps. 3, 4, 8 (McGraw-Hill, New York, 1968); P. G. Bergmann, 
‘Introduction to the Theory of Relativity,’’ pt. I (Prentice-Hall, Englewood Cliffs, 
N.J., 1946); C. Mgller, “The Theory of Relativity,’’ chaps. I-III (Oxford, London, 
1952). 
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describes an electron. In discussing these equations, we devote our 
attention mainly to the deductions that can be made from them and 
do not attempt to establish their Lorentz invariance. We shall, there- 
fore, continue to use three-dimensional vector notation rather than the 
more elegant four-dimensional notation of special relativity theory. 
The invariance of an equation can usually be inferred quite convincingly 
from its symmetry between the space coordinates and the time. 


51 ISCHRODINGER’S RELATIVISTIC EQUATION 


At the time when Schrédinger developed his nonrelativistic wave equa- 
tion, he also proposed an extension of it that meets the requirements of 
special relativity.1 This equation follows quite naturally from the transi- 
tion in classical dynamics from the nonrelativistic relation 


2 
=~ a (51.1) 


between the energy and momentum of a free particle to the corresponding 
relativistic relation 


E? = c*p? + m?c4 (51.2) 
where now E includes the rest-mass energy mc*. We proceed by adopting 
the substitutions (6.13) for E and p 

Eine p— —ikV (51.3) 


FREE PARTICLE 


A relativistic wave equation for a free particle can be obtained by sub- 
stituting (51.3) into (51.2) and operating on a wave function (r,t), just 
as the substitution of (51.3) into (51.1) yields Eq. (6.11). The result is 


—fh2 ay = —WVeVy + m2 4p 
Ot = c (51.4) 


Equation (51.4) has plane wave solutions of the form 
exp 7(k-r — wi) (51.5) 


which are eigenfunctions of the operators E and p in (51.3) with eigen- 
values fw and hk, respectively. It is apparent that (51.5) satisfies Eq. 
(51.4) if 


ho = +(h2c%k? + mc) (51.6) 
1K. Schrédinger, Ann. Physik 81, 109 (1926), Sec. 6, 
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The positive and negative square roots in (51.6) correspond to an ambigu- 
ity in the sign of the energy that also results from the classical expression 
(51.2). We take only the positive square root for the present and return 
to the negative-energy solutions at the end of Sec. 53. 
Expressions for the charge and current densities can be found in 
analogy with those obtained in Sec. 7. The conservation equation 


SPU) + V-+S(rt) =0 | (51.7) 


turns out to be invariant with respect to Lorentz transformations. We 
multiply (51.4) on the left by y*, the complex conjugate equation on the 
left by y, and subtract one from the other. Then (51.7) results, if we 
define real quantities 


_ ih (yah av" 
P(t,t) = Qmc? (v at at 


k (51.8) 
S(r,t) = aim (y*Vy — yVy¥*) 


This expression for S is identical with the nonrelativistic form (7.3), 
and the expression for P can be shown to reduce to (7.1) in the non- 
relativistic limit (see Prob. 1). It should be noted that P given by (51.8) 
is not necessarily positive and hence cannot be interpreted as a position 
probability density. It can, however, be multiplied by e and interpreted 
as an electric charge density, since charge density can have either sign so 
long as it is real. 


ELECTROMAGNETIC POTENTIALS 


We can include the electromagnetic potentials A(r,t), ¢(r,/) in the wave 
equation by making use of the fact that ¢ and (1/c)A have the same 
Lorentz-transformation properties as E and p. In analogy with the non- 
relativistic expression (24.29), we replace (51.2) by 


(E — ed)? = (cp — eA)? + mc (51.9) 
for a particle of charge e. The substitutions (51.3) then give 
0? : 6] . 2 OD 
feos oy ee i ey kee nS 242 
( h ap eho =, teh ft oly 
= [—he?V2 + QehicA-V + iefic(V- A) + e?A2 4+ m’c4]y (51.10) 


We can now find the connection between Eq. (51.10) and the similar 
Eq. (24.39) in the nonrelativistic limit. We make the substitution 


v(r,t) = y/(r,t)e—mertln (51.11) 
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in (51.10) and assume that operation with ih(0/dt) on y’ gives a result 
that is of the same order as egy’ and small in comparison with mer’. 
This is equivalent to subtracting out the rest energy and assuming that 
the remaining energies are small in comparison with it. Differentiation 
of (51.11) gives 


M4 = (U. _ imty) pimnn 


at \ at h 
oy _ (a! _ Dime? ay’ _ mtct J snern 
oe CO «#? hat h? 


The first term in each of these derivatives can be neglected, as can the 
last two terms on the left side of (51.10), which then becomes 


(zines ~ + micty’ — 2eme*oy') ens 


With these approximations, Eq. (51.10) becomes the same as (24.39) if 
y’ is replaced by y. 

There is no way in which the Pauli spin matrices (41.3) can be 
included in Eq. (51.10) without destroying the invariance of the theory. 
This is not surprising, since the spin matrices transform like the compo- 
nents of a three-dimensional, rather than a four-dimensional, vector and 
since ¥ has one component rather than two components like the spin 
functions (33.4). Thus the Schrédinger relativistic equation represents 
a particle that has no spin. 

The structure of Eq. (51.9) shows that a ‘““potential-energy”’ term 
cannot be added arbitrarily to (51.10), as the term Vy was added to 
(24.39) to give Eq. (44.1). The Lorentz transformation properties of 
‘any such term must be investigated first. If it transforms like part of a 
four-dimensional vector, the rest of this vector must be included in some 
such manner as ¢ and (1/c)A were included in (51.9). If it is an invariant 
with respect to Lorentz transformations, it can be included as part of the 
rest energy mc’. 


SEPARATION OF THE EQUATION 


The wave equation (51.10) can be separated with respect to r and ¢ if the 
potentials A, ¢ are independent of the time. We then put 


V(r,t) = u(r)e see 
and substitute into (51.10) to obtain 
(E — e¢)*u = [—hA%c?V? + QiehcA- V 
+ tehe(V > A) + e?A2 + mc4Ju (51.12) 
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We now specialize to the case in which A = 0 and ¢(r) is spherically 
symmetric. Equation (51.12) then becomes 


(—h2ce?V2 + mic)u(r) = [E — ed(r)]*u(r) (51.13) 
which can be separated in spherical coordinates (see Sec. 14) to give 


u(r,6,6) = R(r) Yim(8,) 
|- 2a ("F + eve - (E — ed)? — mict p 


dr r? hic? 
1=0,1,2,... (51.14) 


This reduces to the nonrelativistic radial equation if we put EH = mc? + E’ 
and assume that EL’ and e@ can be neglected in comparison with mc’. 
Then the fraction on the right side of (51.14) becomes (2m/h?)(E’ — e@), 
as it should. 


ENERGY LEVELS IN A COULOMB FIELD 


An exact solution of (51.14) can easily be obtained if we put ed = —Ze?/r, 
by making use of the results of Sec. 16. This situation would represent a 
hydrogen atom were it not for the fact that the particle described by 
(51.10) has no spin-and so cannot be an electron. 

If we put p = ar, Eq. (51.14) can be written as 

— 2 
bb (eM) «B= 1- eR" ]e-g 
(51.15) 

_ 4(m?ct — E?) — 2Ey 
a hic? ~ hea 


This has precisely the form of Eq. (16.7) except that l(J + 1) has been 
replaced by /(1 + 1) — y?.. The parameter \ is determined by the bound- 
ary condition on R when p = ~, and E is expressed in terms of \ by 
eliminating a from the last two of Eqs. (51.15): 


2 oa 
E=me?{1+ 2 (51.16) 
A study of the way in which Eq. (16.7) was solved shows that 
solutions of (51.15) that are finite at p = 0 and o exist only if 
=n +s+1 (51.17) 


where n’ is zero or a positive integer and s is the nonnegative solution of 
the equation 


sis +1) = 10+ 1) - 7’ (51.18) 
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Equation (51.18) has the two solutions 
s = —} + ${(21 + 1)? — 4,7}? (51.19) 


of which one is positive and the other negative for / > 0. 

For | = 0, both the s values given by (51.19) are negative, so that 
R(r), which behaves like r* nearr = 0, is singular at the origin. However, 
y is quite small (very nearly equal to Z/137 if e is the electronic charge), 
so that use of the upper sign in (51.19) gives a value of s that is close to 
zero for physically interesting values of Z, whereas the lower sign makes s 
close to —1. The choice between these two values of s can be made in 
the following way. In a physically realizable case, the source of the 
coulomb field has some small but finite size a that corresponds, say, to the 
nuclear radius, so that ¢(r) is finite everywhere. Then the solution R(r) 
that is finite at r = 0 can be shown to approach the point coulomb solu- 
tion that corresponds to the upper sign in (51.19) when a — 0 (see Prob. 
3). We then use the upper sign in (51.19) for all / and obtain from (51.17) 


h=n'+3+ (+3)? — v7} (51.20) 


Equations (51.16) and (51.20) give a fine structure to the non- 
relativistic energy levels (16.15). This can be seen by expanding the 
expression for the energy levels in powers of y?. The result to terms of 
order y‘ is 


? vy‘ n 3 
E = met | - %- &(-3| onan 


where n = n’ + 1+ 1 is the total quantum number of Eq. (16.14) and 
can take on positive integer values. The first term on the right side of 
(51.21) is the rest energy. The second term is 


and agrees with (16.15). The third term is the fine-structure energy, 
which removes the degeneracy between states of the same n and different 
l. The total spread of the fine-structure levels for a given n is easily 
seen from (51.21) to be 

me*y4n — 1 


ag (51.22) 


This is much larger than is observed experimentally in the spectrum of 
hydrogen. 
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52LIDIRAC’S RELATIVISTIC EQUATION 


Dirac! approached the problem of finding a relativistic wave equation by 
starting from the hamiltonian form (24.1) 


ih s v(r,t) = Hy(rt) (52.1) 


The classical relativistic hamiltonian for a free particle is the positive 
square root of the right side of Eq. (51.2). However, if this is substituted 
into (52.1) and p is replaced by —7hV, the resulting wave equation is 
unsymmetrical with respect to space and time derivatives and hence not 
relativistic. Dirac therefore modified the hamiltonian in such a way as 
to make it linear in the space derivatives. 


FREE-PARTICLE EQUATION 


The simplest hamiltonian that is linear in the momentum and mass term is 
H = ca-p + Bme? (52.2) 
Substitution into (52.1) leads to the wave equation 
(HE — ca: p — Bmc?)y = 0 
or . (52.3) 


(ih 3, + ite av — pmet) y = 0 


We now consider the four quantities az, ay, a, and B. If (52.3) is to 
describe a free particle, there can be no terms in the hamiltonian that 
depend on the space coordinates or the time, since such terms would have 
the properties of space-time-dependent energies and give rise to forces. 
Also, the space and time derivatives are to appear only in p and £, and 
not in a and 8, since (52.3) is to be linear in all these derivatives. We 
thus conclude that @ and 6 are independent of r, ¢, p, and EH and hence 
commute with all of them. This does not necessarily mean that « and 8 
are numbers, since they need not commute with each other. 

We can learn more about a and 8 by requiring that any solution y of 
(52.3) also be a solution of Schrédinger’s relativistic equation (51.4) (the 
converse need not be true). This is a reasonable requirement since, in 
the absence of external fields, the wave-packet solutions of (52.3) whose 


1P, A. M. Dirac, Proc. Roy. Soc. (London) A117, 610 (1928) ; ‘“The Principles of Quan- 
tum Mechanics,” 4th ed., chap. XI (Oxford, New York, 1958). See also J. D. 
Bjorken and S. D. Drell, ‘‘Relativistic Quantum Mechanics” (McGraw-Hill, New 
York, 1964); H. A. Bethe and E. E. Salpeter, “‘Quantum Mechanics of One- and Two- 
electron Atoms,’”’ chap. I(b) (Springer, Berlin, 1957). 
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motions resemble those of classical particles must have the classical 
relation (51.2) between energy, momentum, and mass. We therefore 
multiply the first of Eqs. (52.3) on the left by (H + ca+ p + 8mc’) to 
obtain 


{ E? = Cfaz*pz? + ay*py* + a,*p," + (Ardy + Ayr) Dz Dy 
+ (ayaz + Q2Qy) Dy Dz + (A202 + 202) PzPz] =a, m*c4p? 
—mc*[(az8 + Baz)pe + (a8 + Bay) py + (a,8 + Baz)pJ}y = 0 
(52.4) 


where the substitutions (51.3) for E and p in terms of differential opera- 
tors are implied. Equation (52.4) agrees with (51.4) if a, 8 satisfy the 
relations 


a2 =a, =a =p? =1 
Arty + Ayo, = aya, + ay = aa, + asa, = 0 (52.5) 
az8 + Baz = ay + Bay = a8 + Ba, = 0 


The four quantities are said to anticommute in pairs, and their squares 
are unity. 

Since a, 8 anticommute rather than commute with each other, they 
cannot be numbers. We have seen in Chap. 6 that quantities of this type 
can be expressed in terms of matrices, and it is convenient for calculation 
to have a matrix representation of them. We note first that, since the H 
given by (52.2) is hermitian, each of the four matrices a, 6 must be 
hermitian and hence square. Our problem is to find an explicit repre- 
sentation in which, say, one of these matrices is diagonal (then the others 
cannot be diagonal since they do not commute with this one). In the 
interests of simplicity, we shall require the representation to have as low a 
rank as possible. . 


MATRICES FOR a AND 8 


The squares of all four matrices are unity, so that their eigenvalues are +1 
and —1. We arbitrarily choose 8 as the matrix that is to be diagonal, 
and we rearrange its rows and columns so that all the +1 eigenvalues are 
grouped together in a matrix of rank n, and all the —1 eigenvalues are 
grouped in a matrix of rank m. Since 8 anticommutes with e, it cannot 
be a constant matrix, and so both n and m must be greater than zero. 
‘The 6 matrix can be represented schematically as 


B= E a (52.6) 
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which is an abbreviation for 


1 0 {0 0 
0 1 $0 0 
Oe ge a oe oan) 
0 0 10 -1 


The dashed lines in (52.7) separate the submatrices 1, 0, 0, and —1 that 
appear in (52.6).!_ We now consider the matrix equation a,8 + Ba, = 0, 
the jl element of which is 


(az) i(8; + Bi) = 0 


Here 8; and §; are two of the eigenvalues of 8, which are arranged in 
accordance with (52.6) or (52.7). If 8; = 6), then (az); = 0, whereas if 
8; and 6; have opposite signs, (a): need not be zero. Thus the matrix for 
a, can be written in the form 


a k a (52.8) 


Qr2 0 


where az has n rows and m columns, and az2 has m rows and n columns. 
Since the square of (52.8) is a unit matrix, we see also that 


Gr1te2 = 1 O22021 = 1 (52.9) 


The unit matrix that appears on the right side of the first of Eqs. (52.9) 
has n rows and columns, and the unit matrix in the second equation has 
m rows and columns. It is not difficult to show that no matrices can be 
found that satisfy (52.9) if n,m equal 1,2 or2,1. We therefore consider 
the two possibilities n = m = 1 and n = m = 2 as giving matrices of 
minimum rank.? It is apparent that a, and a, can be written in forms 
similar to (52.8). 

We have already obtained three anticommuting matrices of the form 
(52.6) or (52.8) with n = m= 1. These are the Pauli spin matrices 
(41.3) 


= bi a je E | os E e (52.10) 


1 The matrices 1, 0 are the same as the unit and null matrices 1, O defined in Sec. 22. 
* Higher-rank matrices correspond to particles with spin greater than }. 
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which satisfy the equations 
OxrTy = —~COyCz = 102 (52.11) 


together with the two similar relations obtained by permuting 2, y, z. 
Any matrix with two rows and columns has four elements and so can be 
expressed as a linear combination of the four linearly independent 
matrices oz, oy, ¢;, and 1. Then it is not difficult to show that a fourth 
matrix that anticommutes with all three of those in (52.10) cannot be 
found. 

We therefore try n = m = 2 and for simplicity take a2: = az2, etc. 
Then Eqs. (52.9) become az? = 1, and the equation a,a, + aya, = 0 
becomes aziay1 + ayi@zi = 0. From these and the similar relations 
obtained by permuting zx, y, 2, we see at once that we can identify az 
with o,, etc. We thus arrive at an explicit matrix representation for 6, a: 


1 0 0 0 0 0 oO 1 
10 1 0 0 _|0 oO 1 0 
B=lyo 9 -1 oO| “lo 1 +0 =O 
0 oO oO-1 1 0 oO 0 
: (52.12) 

0 0 oO -: 0 0 1 0 
[0 oOo ¢ 0 _10 0 oO -1 
“lo -i O 0 ~“11 0 0 O 
i.0. 0 0 0-1 O oO 


These matrices are evidently hermitian; we abbreviate them as 


a F a) a . 3 (52.13) 


where each “element” is a matrix with two rows and columns.! 


FREE-PARTICLE SOLUTIONS 


Now that a and 8 are represented by matrices, Eq. (52.3) has no meaning 
unless the wave function y is itself a matrix with four rows and one column: 


¥i(r,£) 
V2 (r,t) 
Va(r,t) 
(r,t) 


1 The relativistic character of the Dirac equation can be made more apparent if (52.3) 
is multiplied through on the left by 8; this enhances the symmetry between space 
and time derivatives, since the four matrices 8, Ba have properties similar to the 
four matrices 8, «. 


v(r,t) = (52.14) 
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Then (52.3) is equivalent to four simultaneous first-order partial differ- 
ential equations that are linear and homogeneous in the four y’s. 
Plane wave solutions of the form 


y; (r,t) = Uj exp i(k So as wt) 


can now be found, 


where the u; are numbers. 


j = 1, 2,3, 4 


(52.15) 


These are eigenfunctions 


of the energy and momentum operators (51.3) with eigenvalues hw and 
ak, respectively. Substitution of (52.15) and (52.12) into (52.13) gives 
a set of algebraic equations for the u;, where E = hw and p = hk are now 


numbers: 
(BE — me?)u, 
(FE — mc?)us 
(E + me?)us 
(EZ + me*)us 


Cpls — C(pz — tpy)us = O 
C(pze + ipy)us + cpus = 0 
— cp — C(pz — tpy)u. = 0 
C(pz + tpy)ui + epsu2 = 0 


I 


(52.16) 


These equations are homogeneous in the wu; and hence have solutions only 


if the determinant of the coefficients is zero. 


This determinant is (EH? — 


m*c* — ¢*p)?, so that the relation between E and p is in agreement with 


(51.2). 


Explicit solutions can be obtained for any momentum p by choosing 


a sign for the energy, say FE, 


+ (cp? ate mct)}, 


Then there are two 


linearly independent solutions, which are conveniently written as 


U1 


Similarly, if we choose the negative square root E_ 


= - CDs 
aa “By + me 
ug = (i= + tv) 
Ey. + me? 
’ oe . (52.17) 
os = C\pz — tpy 
U2 = 1 U3 Ey + me? 
pares Cpz 
RE Boag 


— (c2p? + m2ct)}, 


we obtain two new solutions, which are conveniently written as 


m= Py, Pe + ips) 
1 E_ = me a E_ — mc? 
U3 = 
= C(Pz ree tpy) fae? Sn, Cpz 
A ee Be aet 
U3 = 


(52.18) 


1 


U4 


Each of these four solutions can be normalized, in the sense that yy = 1, 
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by multiplying it by the factor {1+ [c?p?/(Z, + mc?)*]}—-4; Yt is the 
hermitian adjoint of y and is a matrix with one row and four columns. 

It is apparent that the solutions (52.17) correspond to positive 
energy and the solutions (52.18) to negative energy. In the nonrelativ- 
istic limit, in which E, = —E_ is close to mc? and large in comparison 
with clp|, ws and us are of the order of v/c times u1 or ue for the positive- 
energy solutions (v is the speed of the particle) ; the opposite is true for the 
negative-energy solutions. The physical distinction between the two 
solutions for each sign of the energy can be seen by defining three new 
spin matrices o,, o,, ¢, that have four rows and columns: 


,_[s 0 | 
é =I§ | (52.19) 


We shall see at the beginning of Sec. 53 that 3/0’ can be interpreted as the 
operator that represents spin angular momentum. When the small 
components of the wave function can be neglected, it is easy to see that p 
is an eigenfunction of o, with eigenvalue +1 for the first of each pair of 
solutions (52.17) and (52.18), and eigenvalue —1 for the second solution 
of each pair. 


CHARGE AND CURRENT DENSITIES 


We can obtain a conservation equation by multiplying the second of Eqs. 
(52.3) on the left by y', the hermitian adjoint equation 


—th ov — the(Vyt) «a — ytBme? = 0 


on the right by y, and taking the difference of the two results. We then 
get Eq. (51.7) if we define the real quantities 


Port) = yy S(r,t) = oploy (52.20) 


The expression for P has the nonrelativistic form (7.1); since it is never 
negative, it can be interpreted as a position probability density. It can 
be shown that the expression (52.20) for S reduces to (7.3) in the non- 
relativistic limit (see Prob. 6). 

The second of Eqs. (52.20) suggests that the operator ca can be 
interpreted as a particle velocity. This can be verified by calculating 
the time derivative of the position vector r from Eq. (24.10) in the 
Heisenberg picture. With the hamiltonian (52.2) and the commutation 
relations (24.31), we obtain 


dx 


or 4 (cH — Hx) = ca, . (52.21) 
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ELECTROMAGNETIC POTENTIALS 
Terms that involve the electromagnetic potentials can be added to Eq. 
(52.3) in a relativistic way by making the usual replacements cp > cp — 
eA and EK — E — ed, where the particle described by the equation has 
electric charge e. We thus obtain 

[EH — ed — a: (cp — eA) — Bmc’ly = 0 (52.22) 


Here E and p stand for the operators (51.3). This equation can be 
reduced to a form that is similar to (51.10) by multiplying it on the left 
with [EF — eg + a: (cp — eA) + Bmc?]. The result is 
{(E — ed)? — [a+ (cp — eA)]? — met 
— (E — e¢)a: (cp — eA) + «- (cp — eA)(E — ef) }y = 0 
(52.23) 


The second operator in (52.23) can be reduced by making use of the 
following relation: 


(a: B)(a-C) = B-C + i6’> (Bx C) (52.24) 
where B and C commute with a@ but not necessarily with each other (see 
Prob. 7). We identify both B and C with cp — eA; we also require the 
relation 

(cp — eA) x (cp — eA) = —ce(A X p+ pXA) = tehcV X A = tehcH 
where use has been made of (24.30). With this substitution, Eq. (52.24) 
becomes 

[a - (cp — eA)]? = (cp — eA)? — ehce’> H 
The last two operators in (52.23) can be written as 

ea: (EA — AE) + cea: (op — pd) 

= teha: oA + ielica> Vd = —tehca:E 


where use has again been made of (24.30). Equation (52.23) then 
becomes 


[((E — ep)? — (cp — eA)? — m’ct + ehcd’> H — tehca- Ely = 0 
(52.25) 


The first three terms are precisely the same as (51.10). The physical 
significance of the last two terms will now be shown from consideration of 
the nonrelativistic limit of the entire equation. 

We could proceed just as we did in obtaining the nonrelativistic 
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limit of Eq. (51.10). A slightly different approach consists in putting 
E = E’ + me? (52.26) 


and assuming that E’ and e¢ are small in comparison with mc?; this is 
equivalent to the substitution (51.11) and the subsequent approxima- 
tions. We can then make the replacement 


(E — ed)? — mict = 2mc?(E’ — ed) 
in (52.25) to obtain 


‘ : 

E'y = | ( ~<a) +6 ~ aH + xh a E|y (52.27) 
Now E’ is equivalent to the time derivative operator 7h(d/dt) if a factor 
e~ime#t/4 is taken out of y. Thus (52.27) is the nonrelativistic Schrédinger 
equation (24.39), with two additional terms that involve H and E directly. 

The H term has the form associated with the energy of a magnetic 
dipole of moment (eh/2mc)é’._ It was shown in (52.18), for a free particle, 
that the first and second components of the wave function are large in 
comparison with the third and fourth components for the positive-energy 
solutions in the nonrelativistic limit. This can also be shown without 
difficulty, when the particle is not free, from the structure of the general 
equation (52.22). Equation (52.19) shows that 6’ operating on the four- 
component wave function is the same as é operating on the large compo- 
nents alone. Thus the two large components of (52.27) with the H term 
give just the nonrelativistic equation with the Pauli spin matrices and the 
correct coefficient for the magnetic moment of an electron [see Eq. (48.10), 
in which the electron charge is —e]. 

We now show that in practical cases the E term in (52.27) is of 
order (v/c)? times the ed term and so is to be neglected in the nonrela- 
tivistic limit.!. We note first that the expectation value of « is of order 
(v/c)fyty dr, since (52.13) shows that @ mixes the large and small com- 
ponents (we also saw earlier that ca is the velocity operator). For an 
electron that is part of a system of linear dimensions a, e¢ is of order 
eEa, and A/a~ p~mv. Thus the ratio of the E to the e@ terms in 
(52.27) is of order 


ehvE 1 y? 
BRP yi 
mc? eka c? 


In contrast with this, we have already seen in Eqs. (48.9) and (48.10) that 
the H term in (52.27) is of the same order as the other magnetic terms 
that are linear in A. Although the E term must be omitted from (52.27) 


1 For a spherically symmetric electrostatic potential it leads to the spin-orbit energy, 
which is actually of order (v/c)*e¢; see Eq. (53.8). 
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for a consistent nonrelativistic approximation, it cannot be dropped from 
the relativistic equation (52.25), where it is required to preserve Lorentz 
invariance. 


53L]DIRAC’S EQUATION FOR A CENTRAL FIELD 


In the preceding section, the existence of the magnetic moment of an 
electron was demonstrated explicitly by showing that the expected extra 
magnetic energy appears in the nonrelativistic approximation. The 
electron spin carries no energy in itself and so can be observed only 
through its coupling with the orbital motion of the electron. In the 
first part of this section, we make this coupling apparent in two ways: 
through conservation of total angular momentum and through the spin- 
orbit energy that was introduced in Sec. 47. In both cases we work with 
such potentials A, ¢ that there is no transfer of angular momentum to the 
electron; this implies that we have a central field (A = 0 and ¢ spheri- 
cally symmetric). In the latter part of this section, we separate the Dirac 
equation for a general central field and find the energy levels of the 
hydrogen atom. . 


SPIN ANGULAR MOMENTUM 
With A(r,t) = 0 and ¢(r,t) = ¢(r), Eq. (52.22) can be written as 
.. OW 
h—=H 
gp Oke (53.1) 
H =ca:p+ Bm? + V 


where V = ed. We might expect that the orbital angular momentum 
L = r X pis a constant of the motion in such a central field. In order to 
investigate this point, we calculate the time rate of change of L in the 
Heisenberg picture with the help of Eqs. (24.10) and (24.31): 

aL, 


th Ta L,H — HL, 


= ca: [(yp. — zpy)P — p(yp: — Zpr)] 
= —the(apy = QyP2) (53.2) 


since L commutes with any spherically symmetric function such as V(r). 
It is apparent that L does not commute with H and hence is not a con- 
stant of the motion. However, we expect on physical grounds that it is 
possible to define a total angular momentum that is constant in a central 
field of force. This means that we must find another operator such that 
the commutator of its x component with H is the negative of the right 
side of (53.2); the sum of this operator and L is then a constant of the 
motion and can be interpreted as the total angular momentum. 
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It is not difficult to see that the desired operator is a multiple of the 
3’ defined in (52.19). From (52.11) and (52.13), we find that o, com- 
mutes with a, and 8, although not with the other components of a: 


ry _ for O aa | pelea | ae 
Fety We 10 oe lle, 0 o, OO «a, 


_[O #]_[ 9 tm] _ 9. 
~ Lio, 0 —is, 0 


The time rate of change of d’ can now be obtained: 
dt 
It is apparent from (53.2) and (53.3) that the quantity L + $h0’ com- 


mutes with H and can therefore be taken to be the total angular momen- 
tum. We refer to the operator 


S = jhe’ (53.4) 


as the spin angular momentum of the electron. 


= 01H — Hol = 2iclapy — aypz) (53.3) 


APPROXIMATE REDUCTION; SPIN-ORBIT ENERGY 


We now wish to show that the spin-orbit energy (47.13) is a consequence 
of the Dirac equation. This term can be shown to be of order (v/c)? 
times the potential energy: 


1 1 IdV 1 iV v? 
Vimar a “9 ~ pag gi Pt~ o 


where a represents the linear dimensions of the system, and 


a PT mw 
Thus the approximations that led to (52.27) are not adequate for the 
present purpose. 

In order to obtain a consistent approximation that is expressed in 
terms of the more familiar two-component wave functions, we replace 
¥ in (53.1) by ¥1 and 2, which now represent the first two and the last 
two components of y, respectively. We assume that yi, ¥2 together con- 
stitute a nonrelativistic energy eigenfunction, which means that 


E = E’ + me? 
is regarded as a number rather than an operator; E’ and V are assumed to 
be small in comparison with mc?. The wave equation then becomes 
(E’ — Vy — c6+ py = 0 (53.5) 
(E’ + 2mc? — V)p2 — cé- py = 0 
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where p is still an operator. It is apparent from the second of these 
equations that y2 is of the order of v/c times yi, and so we eliminate it to 
obtain an equation in terms of y¥; alone. The substitution 


= (E’ + 2mc? — V)-'cé > py 
in the first of Eqs. (53.5) gives 


Ey = a (6+ p) ¢ + me) (6: p)f¥i + Via (53.6) 


Thus far, no approximations have been made. 

The desired approximation is obtained by keeping the lowest terms 
in an expansion in powers of (Z’ — V)/2mc?. The following relations 
are easily established: 


pV = Vp — 274VV 
(6-VV)(6-p) = (VV) - p+ ié:[(VV) x pl 
With the help of these, (53.6) becomes 
Ey, = Ie — “aa ) + v| v1 
— ghia (VV) «(Wus) + gery 6+ (VV) x Pua] (63.7) 


Further simplications can be made if V is spherically symmetric. 
We use the relations 


dV a 
(VV) Y= Gr ar 
r ar 


and note that E’ — V is approximately equal to p?/2m, the accuracy 
being sufficient to replace the second-order term (E’ — V)p? in (53.7) by 
p*/2m. We can then rewrite (53.7) as 


my = {Pr _ _Pt __W ada, 1 1ldVg, 
En = (= Smic? ue 4m?c? dr or 7 Qm2c? r dr Soh vs 


where now S = d#é and L = r x p. 

The first and third terms on the right side of (53.8) give the non- 
relativistic Schrédinger equation. The second term has the form of the 
classical relativistic mass correction, which can be obtained by expanding 
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the square root of (51.2): 

2 4 
' mee 2 = (22 2n4)t _. oy) agen es 
E'! = E — mc (c?p? + m’c*) mc Sa (Bintan 
The last term is the spin-orbit energy (47.14), which is now seen to appear 
as an automatic consequence of the Dirac equation. The fourth term is 
a similar relativistic correction to the potential energy, which does not 
have a classical analog. Since it does not involve the angular momenta, 
it is much more difficult to demonstrate experimentally than the spin- 


orbit energy.! 


SEPARATION OF THE EQUATION 


The Dirac equation for a central field can be separated without approxi- 
mation in spherical coordinates. The procedure is more complicated 
than for either of the Schrédinger equations because of the interdepend- 
ence of orbital and spin angular momenta. We start by defining radial 
momentum and velocity operators . 


pr = ri(r+p — th) a, = ro (a-f) (53.9) 


both of which can be shown to be hermitian. We also define an operator 
k that will shortly be seen to be related to the total angular momentum: 


Ak = B(e’-L +h) (53.10) 


where L =rxp. It can be shown by direct substitution, with the help 
of (52.24), that 


arp, + thr—a,Bk = a-p 
The hamiltonian (53.1) then becomes 

H = casp, + = a,Bk + Bmc? + V (53.11) 
The following relations can be established with the help of the definitions 
(53.9) and (53.10) and the relations of Sec. 52: : 

a,k — ke, = 0 Bk — kB = 0 pk — kp, = 0 

ArDr — DPrhy = 0 


These show that & commutes with the hamiltonian (53.11) and so is a 


constant of the motion. The eigenvalues of k can be inferred by squaring 
(53.10): . 


h?k? = (d’- L)? + 2h(e’- L) + h? = (L + gho’)? + th? (53.12) 
The quantity (L + $d’)? is the square of the total angular momentum 


1¥or further discussion of this term, see E. U. Condon and G. H. Shortley, ‘‘The 
Theory of Atomic Spectra,” p. 130 (Cambridge, London, 1935). 
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and has eigenvalues j(j + 1)h?, where j is half a positive odd integer. 
Thus k? has eigenvalues (7 + 4)?, so that k can be +1, +2,... 

We now choose a representation in which H and k are diagonal and 
represented by the numbers F and k, respectively. a, and 8 can then be 
represented by any hermitian matrices that satisfy the relations 


a? = § = 1 a,B + Ba, = 0 


which are not difficult to verify. Such matrices can have two rows and 
columns; for example, we can put 


1 0 0 -7 
ee ee ed fay 


Now the angular and spin parts of the wave function are fixed by the 
requirement that y be an eigenfunction of the k operator (53.10). For 
such purposes as the computation of energy levels, we need be concerned 
only with the radial part; because of the structure of (53.13), this has 
two components, which we write 


r—F (r) | 
erg (53.14) 


Substitution of (53.13) and (53.14) into the wave equation with the 
hamiltonian (53.11) gives us the radial equations for an electron moving 
in a central, field. We make use of the relation 


af 0 1 
Dr = —th (3.+2) 


to obtain 
(E ~— mc? — VF + he & 4 MEG =0 
GF | kek (53.15) 
(E+ me? — V)G —tte — 4+ "“F =0 
dr r 
It is convenient to make the numerical substitutions 
_me?+E _me?—E aie 
eg Sa ee 
2n4 _ 2\3 
a= +(aia2)? = we (53.16) 
Cc 
in terms of which Eqs. (53.15) become 
a 4 k jammer (ace @ LAT 0 
dp p a thee 
(53.17) 


Prine 
Sa 
| 
DIS 
ee” 
| 
wren, 
2/8 
| 

S| 
g[N 
{ 
© 
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THE HYDROGEN ATOM 


We now find the energy eigenvalues of (53.17) when V(r) = ~—Ze?/r; 
with y = Ze?/he, the quantity V/hca becomes —vy/p. We follow a 
procedure that is analogous to that of Sec. 16 and put a 


F(p) = f(p)e* G(p) = g(p)e-" (83.18) 


The equations for f and g are 


kg a2 Y 
Pies ng for =0 
gg ( x) 9 


kf (53.19) 
ay Y : 
ro gi WM Fl a4 =0 
i ; (2 + ~) g 
We look for solutions of (53.19) in the form of power series: 
= pt(a ta ee ao = O 
f= p*(ao + aip + ) 0 (53.20) 
g=p(bo+bip+-:--) bo ¥ 0 


Since (53.14) is supposed to be finite at r = 0, we expect that s is greater 
than or equal to 1. However, in analogy with the solution of the Schréd- 
inger relativistic equation (51.15) for the coulomb field, we shall admit a 
value of s slightly less than 1. 

We substitute (53.20) into (53.19) and equate the coefficients of 
p*t’—! to zero: 


(s + v + k)b, — b1 + ya, — Sa, = 0 


| (53.21) 
(s + » — k)a, — a,_1 — yb, — S Bpat =0 
for y > 0. When v = 0, the equations analogous to (53.21) are 
8 + k)by + yao = 0 
( )bo Yao (53.22) 


(s — k)ao — ybo = 0 


Equations (53.22) have the required nonvanishing solution for ao and 
bo only if the determinant of their coefficients vanishes; this gives 


s = +(k? — y2)# (53.23) 


Because of the boundary condition at the origin, we take the upper sign 
for s in (53.23). 

A relation between a, and 6, can be obtained by multiplying the first 
of Eqs. (53.21) by a, the second by a2, and subtracting: 


b,[a(s + »v +k) + acy] = a,[ae(s + »v — k) — ay] (53.24) 
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where use has been made of (53.16). We can now examine the behavior 
of the solution at large vr. Unless both the series (53.20) terminate, this 
behavior is determined by their high terms, and so we can neglect constant 
factors in comparison with »v. We then obtain from (53.21) and (53.24) 

a, ~ — a,_-1 b, ~ zs by-1 

v v 

This means that both series have the asymptotic form e?*, and regular 
solutions are obtained only if they terminate. Suppose that this occurs 
at v = ’, so that @a41 = bay1 = 0. Then both Eqs. (53.21) give the 
relation 

A2An = —aby: m= OO Ny Bye es (53.25) 

We obtain energy levels by setting » = n’ in (53.24) and making use 
of (53.25). With the help of (53.16), we find that 

, 2H 
2a(s +n’) = y(a1 — ar) = = 


which incidentally shows that E > 0. The square of this is 
(m?ct — E*)(s + n’)? = Ey? 


which is easily solved to give 
E 2) q Y 3 53.26 
= MC + | ( r ) 


Equations (53.23) and (53.26) are equivalent to the formula first derived 
by Sommerfeld! on the basis of the old quantum theory. This formula 
accounts quite well for the spectrum of hydrogen.? The fine structure 
is made evident by expanding (53.26) in powers of y?. The result to 
terms of order y* resembles (51.21) but is not quite the same: 
2 4 
Lil Pena ee Oe 8 
E=me E an? x(F :) | (53.27) 
where n = n’ + |k| is the total quantum number of Eq. (16.14), and 
|k| can take on positive integer values. The total spread in energy of 
the fine-structure levels for a given n is easily seen from (53.27) to be 
me*yin — 1 
ni 2n 


This is substantially less than the value (51.22) obtained from the 
Schrédinger relativistic equation and is in agreement with experiment. 


1A. Sommerfeld, Ann. Physik. 61, 1 (1916). 


2 There are, however, small but important deviations from this formula; see W. E. 
Lamb, Rept. on Progr. Phys. 14, 19 (1951). 
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CLASSIFICATION OF ENERGY LEVELS 


For n’ > 0, all positive and negative integer values of k are permissible 
[we saw from (53.12) that & cannot be zero]. For n’ = 0, however, a 
contradiction can arise between (53.22) and (53.25); these give 


bo _ ae. 
ao s+k and ao a 


(53.28) 


respectively. Since s < |k|, the first of these expressions is positive or 
negative according as k is negative or positive, whereas the second is 
always negative. Thus, for n’ = 0, k can assume only positive integer 
values. 

Thus far we have shown only that the j value of a level is equal to 
|k| — 4. In order to connect J with the level, we must make the non- 
relativistic approximation that the orbital angular momentum is well 
defined. Since in this case F in (53.14) is much larger than G, we 
can replace 8 by —1 and d’ by ¢ in (53.10). In this approximation, 
(L + 4he)? = (11 + 1) + Sh? + f6-L and is also equal to 7(j + 1)h?. 
We obtain in this way 


k= sG+y-u+nee= | T) 


jzl 
—l jzl—j 
As an example of the energy levels in hydrogen, we consider the case 
n = 3. The radial quantum number n’ can be 0, 1, or 2, and k can be 


+(3 — n’) except that k can be only +3 when n’ = 0. The levels 
with their nonrelativistic classifications are 


n' k l 

0 3 2 $ 2D; 
1 -2 2 2 2D, 
1 2 1 3 *Py 
2 —1 1 + 2Py 
2 1 o 2 8 


According to (53.23) and (53.26), states with the same |k| or 7 have the 
same energy; Eq. (53.27) shows that the energy increases with increasing 
[kl. 


NEGATIVE ENERGY STATES 


We have seen that both the Schrédinger and Dirac relativistic equations 
admit of solutions for which a particle has negative kinetic energy and 
negative rest mass. These solutions correspond to the negative square 
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e 


root of the classical energy equation (51.2). The negative-energy solu- 
tions cannot be ignored in the quantum theory, as they are in the classical 
theory, since there is nothing to prevent a charged particle from making 
a radiative transition from a state of positive energy to a state of negative 
energy. 

Dirac proposed that we regard the negative energy states of Eq. 
(52.22) as being full, in which case the exclusion principle prevents transi- 
tions into such occupied states. The normal state of the vacuum then 
consists of an infinite density of negative-energy electrons. It is assumed 
that there are no electromagnetic or gravitational effects of these elec- 
trons but that deviations from the norm produced by emptying one or 
more of the negative energy states can be observed. The absence of a 
negatively charged electron that has negative mass and kinetic energy 
would then be expected to manifest itself as a positively charged particle 
that has an equal positive mass and kinetic energy. In this way, a 
“hole” theory of positrons can be formulated. 

With so many electrons present, however, the theory is no longer 
the one-particle theory contemplated when the wave equation was set 
up. A many-particle theory can be based on the Dirac equation in 
accordance with the formalism of quantized fields discussed in the next 
chapter, and a theory of positrons can be developed without recourse 
to holes. 

It might at first be thought that a similar technique cannot be 
applied to the Schrédinger relativistic equation, since it describes a 
particle of zero spin, which we expect to obey Einstein-Bose statistics 
rather than the exclusion principle. However, Pauli and Weisskopf! 
showed that the quantized field energy is always positive in this case, 
even though the parameter E in the wave equation can be either posi- 
tive or negative. Moreover, the charge in the quantized field can have 
either sign, corresponding to the ambiguity of the sign of P noted after 
Eq. (51.8). Thus both the theories discussed in this chapter predict 
the existence of particles that have positive energies and both signs of 
electric charge. The appearance of spin angular momentum as a con- 
sequence of the Dirac equation shows that this is the theory that describes 
electrons. 


PROBLEMS 


1. Use the nonrelativistic approximation implied in (51.11) and in the immediately 
following discussion to show that the expression (51.8) for P reduces to (7.1) in the 
proper limit. 

2. Solve the Schrédinger relativistic equation for an attractive square well potential 
of depth V, and radius a, after determining the continuity conditions at r = a. 


1 W. Pauli and V. Weisskopf, Helv. Phys. Acta 7, 709 (1934). 
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Obtain an explicit expression for the minimum Vp» with given a that just binds a 
particle of mass m. 

3. Find the solution of the Schrédinger relativistic equation that is finite at r = 0 
and corresponds to the potential energy e¢ = —Ze?/a for r <a and eg = —Ze?/r 
for r > a, when a is very small. Note that only the first two terms of each power 
series in r need be retained. Show that the solution for r > a approaches that which 
corresponds to the upper sign in (51.19) when a — 0. 

4. Show that any matrix with two rows and columns can be expressed as a linear 
combination of ¢,, oy, ¢:,and 1. Use this result to show that there is no matrix that 
anticommutes with each of the first three of these. 

5. Show explicitly that the wave functions (52.17) and (52.18) are not eigenfunctions 
of any component of the spin angular momentum $/d’. 

6. Show that the current density given by (52.20) for a free-particle wave function 
agrees with the corresponding nonrelativistic expression in the proper limit. 


7. Make use of Eqs. (52.11), (52.13), and (52.19) to verify Eq. (52.24). 


8. Prove that the operators a, and k defined by Eqs. (53.9) and (53.10) commute 
with each other and that h?k? is given by the right side of (53.12). 


9. Discuss the connection between the a+ £ term in Eq. (52.27) and the spin-orbit 
energy. 


10. Show that the negative square roots that could appear in arriving at Eqs. (51.16) 
and (53.26) actually do not correspond to bound states. 


11. Use the selection rules Al = +1, Aj = 0, +1 to list the frequencies of the 
allowed transitions between the states with n = 2 and n = 3 for the coulomb field, 
in both the Schrédinger and Dirac relativistic theories. In particular, show that the 
latter theory gives seven lines, of which five are distinct, whereas the former gives 
three lines that are much more spread apart. 


12. Solve the Dirac equation for an attractive square well potential of depth Vo and 
radius a, after determining the continuity conditions at r = a. Obtain an explicit 
expression for the minimum V, with given a that just binds a particle of mass m, 
and compare with the answer to Prob. 2. 


14. 
The Quantization of Wave Fields 


The theory of quantum mechanics presented thus far in this book has 
dealt with systems that, in the classical limit, consist of material particles. 
We wish now to extend the theory so that it can be applied to the electro- 
magnetic field and thus provide a consistent basis for the quantum theory 
of radiation. The quantization of a wave field imparts to it some particle 
properties; in the case of the electromagnetic field, a theory of light quanta 
(photons) results. The field quantization technique can also be applied 
to a y field, such as that described by the nonrelativistic Schrédinger 
equation (6.16) or by one of the relativistic equations (51.4) or (52.3). 
As we shall see in the nonrelativistic case (Sec. 55), it then converts a 
one-particle theory into a many-particle theory, in a manner equivalent 
to the transition from Eq. (6.16) to (16.1) or (40.7). Because of this 
equivalence, it might seem that the quantization of y fields merely pro- 
vides another formal approach to the many-particle problem. However, 
the new formalism can deal as well with processes that involve the creation 
or destruction of material particles (radioactive beta decay, meson- 
nucleon interaction). 
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This chapter is intended to serve as an introduction to quantum 
field theory.! We start in Sec. 54 with a discussion of the classical and 
quantum equations of motion for a wave field, without specifying the 
detailed nature of the field. The application to Eq. (6.16) is used as a 
first example in Sec. 55. Several other particle wave equations (including 
the relativistic Schrédinger and Dirac equations) have also been quantized 
but are not discussed here. The electromagnetic field is considered in the 
last two sections. 


54CICLASSICAL AND QUANTUM FIELD EQUATIONS 


A general procedure for the quantization of the equations of motion of 
a classical system was obtained in Sec. 24. We start with the lagrangian 
function for the system and verify that it gives the correct classical 
equations. The momenta canonically conjugate to the coordinates of 
the system are found from the lagrangian, and a hamiltonian function 
is set up. The classical hamiltonian equations of motion are then con- 
verted into quantum equations by the substitution of commutator 
brackets for Poisson brackets; this gives the change of the dynamical 
variables with time in the Heisenberg picture. We now show how this 
procedure can be applied in its entirety to a wave field (r,t), which we 
assume for the present to be real.” 


COORDINATES OF THE FIELD 


A wave field is specified by its amplitudes at all points of space and the 
dependence of these amplitudes on the time, in much the same way as a 
system of particles is specified by the positional coordinates g; and their 
dependence on the time. The field evidently has an infinite number of 
degrees of freedom and is analogous to a system that consists of an 
infinite number of particles. It is natural, then, to use the amplitudes 
y(r,t) at all points r as coordinates, in analogy with the particle coordi- 
nates q,(t) of Sec. 24. 

It is not necessary, however, to proceed in this way. As an alter- 
native, we can expand y in some complete orthonormal set of functions 


1 For further discussion, see P. A. M. Dirac, ‘“‘The Principles of Quantum Mechanics,” 
4th ed., chaps. X, XII (Oxford, New York, 1958); H. Goldstein, ‘‘Classical Mechan- 
ics,’ chap. 11 (Addison-Wesley, Reading, Mass., 1950); J. D. Bjorken and 8. D. Drell, 
“Relativistic Quantum Fields” (McGraw-Hill, New York, 1965); E. Henley and W. 
Thirring, ‘Elementary Quantum Field Theory’? (McGraw-Hill, New York, 1962); 
S. S. Schweber, ‘“‘An Introduction to Relativistic Quantum Field Theory” (Harper & 
Row, New York, 1961); J. J. Sakurai, “Advanced Quantum Mechanics” (Addison- 
Wesley, Reading, Mass., 1967). 


2 W. Heisenberg and W. Pauli, Z. Physik 66, 1 (1929); 59, 168 (1930). 
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Uk: 


/  Wr,t) = Spax(t)ux(r) (54.1) 


The expansion coefficients a, in (54.1) can be regarded as the field coordi- 
nates, and the field equations can be expressed in terms of either y or the 
a,. We shall use the wave amplitudes at all points as the field coordi- 
nates in this section. It will be convenient for some of the later work to 
make use of the coefficients a,. 


TIME DERIVATIVES 


It is important to have clearly in mind the meaning of time derivatives 
in classical and quantum field theories. In classical particle theory, both 
total and partial time derivatives were defined in connection with a 
function F(q.,p:,t) of the coordinates, momenta, and time; these deriva- 
tives are related through Eq. (24.22). Similarly, both derivatives were 
defined for a Heisenberg-picture operator and related to each other as 
in Eq. (24.10). In classical field theory, y(r) is the analog of g,, and 
the only time derivative that can be defined is dy/dt; we refer to it as 
¥ in analogy with q; in the particle case. Thus, in the classical hamil- 
tonian equations of motion of the field [(54.19) below], we interpret y, 
and also 7, as partial time derivatives. However a functional F(y,7,2) 
can depend explicitly on the time as well as on the field, so that it is 
important to distinguish between dF'/dié and dF /dt in Eq. (54.20). 

The same situation appears in quantum field theory. No distine- 
tion can be made between dy/dt and dy/dt, and both are referred to as y. 
On the other hand, a Heisenberg-picture operator can depend explicitly 
on the time, and the distinction between the two time derivatives must 
be made in Eq. (54.23). 


CLASSICAL LAGRANGIAN EQUATION 


The lagrangian L(q;,q:,t) used in Sec. 24 is a function of the time and a 
functional of the possible paths q,(¢) of the system. The actual paths 
are derived from the variational principle (24.17): 


By analogy, we expect the field lagrangian to be a functional of the 
field amplitude y(r,t). It can usually be expressed as the integral over 
all space of a lagrangian density L: 


L = fLp,Vy¥,t) dr (54.2) 


where, as remarked above, y is dy/dt or dy/dt. The appearance of Vy 
as an argument of L is a consequence of the continuous dependence of y 
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on r (continuously infinite number of degrees of freedom); higher deriva- 
tives of y could also be present but do not seem to arise in problems of 
physical interest. The variational principle that corresponds to (24.17) 
is 
te t fe 

af" Lat = 6f" [ Lata = f" [ (L) dear = 0 (54.3) 
where the infinitesimal variation 6y of y is subject to the restrictions 

bY(t,t1) = dY(r,t2) = 0 (54.4) 


If Z has the form indicated in (54.2), its variation can be written 


WN: OB 
5L = + D aayran (ze yy) + Sr a (54.5) 


where the summation over 2, y, z implies the sum of three terms with y 
and z substituted for x. Now dy is the difference between the original 
and varied ¥ and hence is the time derivative of the variation of y. 
This and the similar expression for 6(d¥/dx) can be written 


w= Sy) 8 (34) - 2 x (6H) 


Equation (54.3) then becomes 


i. NF + D sagan) a OY) + Sp a5 OP | atarr =o (54.6) 


The summation terms in (54.6) can be integrated by parts with 
respect to the space coordinates; the surface terms vanish, either because 
y falls off rapidly enough at infinite distance, or because y obeys periodic 
boundary conditions at the walls of a large but finite box. The last term 
of (54.6) can be integrated by parts with respect to the time, and the 
boundary terms vanish because of (54.4). Equation (54.6) can therefore 
be written 


{ef (3 - Lela | 2 3 (2) bydid’* =0 (54.7) 


Since (54.3) is valid for an arbitrary variation 6y at each point in space, 
Eq. ee is equivalent to the differential equation 


i =e Ox Fez iz 5 (22) =0 (54.8) 


Equation (54.8) is the classical field equation derived from the lagrangian 
density L(y, Vy,y¥,t). 
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FUNCTIONAL DERIVATIVE 


In order to pursue further ‘the analogy with particle mechanics, it is 
desirable to rewrite Eq. (54.8) in terms of LZ rather than L. Since the 
aggregate of values of y and y¥ at all points is analogous to the q; and 4; 
of particle theory, we require derivatives of L with respect to y and ¥ 
at particular points. These are called functional derivatives and are 
denoted by @L/ay and aL /ay. Expressions for them can be obtained 
by dividing up all space into small cells and replacing volume integrals 
by summations over these cells. The average values of quantities such 
as y, Vy, and y in the 7th cell are denoted by subscripts 7, and the volume 
of that cell by 67;. Then 


D, Liss (WW)ss vast Br 


approaches L in the limit in which all the 67; approach zero. 
In similar fashion, the ¢ integrand in Eq. (54.6) or (54.7) can be 
replaced by 


oL %) aL oL 
2 (3 doe Eezoilh by; 675 + a (=) dy; 67; 


where the variation in L is now produced by independent variations in 
the y,; and the ¥,. Suppose now that all the dy; and 6y; are zero except 
for a particular dy,;. It is natural to relate the functional derivative of 
L with respect to y for a point in the jth cell to the ratio of 5L to dy,; 
we therefore define 


aL. sb AL | aL | (54.9) 


BV” ano BY; 1; OP LL dx | (GY 0z) 
Similarly, the functional derivative of L with respect to ¥ is defined by 
setting all the dy; and 6¥,; equal to zero except for a particular dy;: 


aL ; 6L OL 
= lim =— 
ay = r:0 bf; br; 


(54.10) 


Here again the point r at which the functional derivative is evaluated is 
in the jth cell. Substitution of (54.9) and (54.10) into (54.8) gives 


99L _ al _ 4 _ (54.11) 


which closely resembles the lagrangian equations (24.18) for a system of 
particles. 
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CLASSICAL HAMILTONIAN EQUATIONS 


The momentum canonically conjugate to y; can be defined as in particle 
mechanics to be the ratio of 6Z to the infinitesimal change 6¥; when all 
the other 6y; and all the 6y; are zero. We thus obtain 


8L aL 
P; = — = 65; (=) (54.12) 


It follows from (54.11) and (54.12) that 


aL 
P; = 3 ins(22) (54.13) 


The analogy with Eq. (24.19) then gives for the hamiltonian 


H = ) Pas Le 1G 7) vs drs = L (64.14) 


We write H as the volume integral of a hamiltonian density H and 
assume that the cells are small enough so that the difference between a 
volume integral and the corresponding cell summation can be ignored; 
we then have 


_ ; oy — 2b ob i 
H=[Hdt H=n-L aa hay (54.15) 
The approximate hamiltonian (54.14), with the relations (54.12) and 

(54.13), can be manipulated in precisely the same way as the hamiltonian 
for a system of particles. Instead of showing this explicitly, we now work 
with the true field hamiltonian H given in (54.15), which is a functional 
of ¥ and x from which y has been eliminated. The classical hamiltonian 
equations of motion will be derived without further recourse to the cell 
approximation. The variation of ZL produced by variations of y and ¥ 
can be written, with the help of (54.11) and (54.15), 


éL = f (= by + = a) dr = / (roy + wdp) d?r 


S(drp) + zip — Por] d*r 
8H + 6L + (rsp — por) d’r (54.16) 


N 


The variation of H produced by the corresponding variations of ¥ and r 


can be written 


bH = i (2 by +§ si =) dr vote 
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It follows from the earlier discussion of functional derivatives that 


aH _ dH yo aH 
ay =s- ay £ ox 0(dy/dz) 
aH _ all 1 an (54.18) 


Ox on~—SdL, 0 O(O/OZ) 
ryz 


Comparison of Eqs. (54.16) and (54.17) for arbitrary variations éy and 
dx then gives the classical field equations in hamiltonian form: 


= — t= -— (54.19) 


The hamiltonian equation for the time rate of change of a functional 
_F of ¥ and 7 can now be found. We express F as the volume integral of 
the corresponding functional density F(y,7,t), which for simplicity is 
assumed not to depend explicitly on the time or on the gradients of y 
or. The foregoing analysis can be used to show that 


“dF _ oF oF OF . 
aa at f Get eter 


aF aF aH aF aly ,, 
a+ {ee 5 a 


= 2+ (FH) (54.20) 


This equation also serves to define the Poisson bracket expression for two 
functionals of the field variables. The right side of Eq. (54.20) is not 
changed if F also depends on Vy or Vm (see Prob. 2). It is apparent 
from (54.20) that H is a.constant of the motion if it does not depend 
explicitly on the time; in this case, H is the total energy of the-field. 


QUANTUM EQUATIONS FOR THE FIELD 


The analogy between particle coordinates and momenta q;, p; and the cell 
averages y;, P; suggests that we choose as quantum conditions for the 


field 
[viv] = [PP] = 0 [yi,P3] = thd; - (54.21) 


This means that we have converted the wave field from a real numerical 
function to a hermitian operator in the Heisenberg picture. 

We now assume that the cell volumes are very small. Then Eqs. 
(54.21) can be rewritten with the help of (54.12) and (54.15) in terms of 
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y and r: 


[y(r,t),w(r’,2)] = [r(1,t),(x’,t)] = 0 
[W(r,t),r(r',é)] = thd,2’) 


where 4(r,r’) = 1/67; if r and r’ are in the same cell and zero otherwise. 
The function 6(r,r’) has the property that [f(r)6(r,r’) dr is equal to the 
average value of f for the cell in which r’ is situated. Thus, in the limit 
in which the cell volumes approach zero, 4(r,r’) can be replaced by the 
three-dimensional Dirac 6 function 63(r — r’). The quantum conditions 
for the canonical field variables then become 


[W(z,t) W(r’,t)] Te [w(r,t),w(r’,t)] = 0. 


j (54.22) 
[y(r,) ,w(r’,t)] = thé* (r — r’) 


The equation of motion for any quantum dynamical variable F is 
obtained from Eq. (24.10) or by replacing the Poisson bracket in Eq. 
(54.20) by the commutator bracket divided by 7h. 

dF oF , 1 | | 


er a 7 ([F,H] (54.23) 
The commutator bracket can be evaluated with the help of (54.22) when 
explicit expressions for F and H in terms of y and z are given. Thus Eqs. . 
(54.22) and (54.23) completely describe the behavior of the quantized 
field that is specified by the hamiltonian H. 


FIELDS WITH MORE THAN ONE COMPONENT 


Thus far in this section we have dealt with fields that can be described 
by a single real amplitude. If the field has more than one component 
vi, ¥2, ... , the lagrangian density has the form L(y, Aya, v1, Wo, Avo, 
v2, ... ,t). Then if each of the field components is varied independ- 
ently, the variational equation (54.3) leads to an equation of the form 
(54.8) or (54.11) for each of xi, yo, .. .. A momentum canonically 
conjugate to each y, can be defined as in Eq. (54.15) to be 7, = OL/dy,. 
The hamiltonian density has the form 


H=)r¥,—L . (54.24) 


and the hamiltonian equations consist of a pair like (54.19) for each s. 
Equation (54.23) is unchanged, and the commutation relations (54.22) 
are replaced by 


[We(r,t) ,Wer(r’,t)] = [re(r,t),75-(r’,t)] = 0 


, (54.25) 
[Y.(1,t),7e"(r’,t)] = hd. 53 (r ™ r’) 
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COMPLEX FIELD 


Thus far we have dealt with fields that are real numerical functions in 
the classical case and hermitian operators in the Heisenberg picture in 
the quantum case. A different situation that is of immediate interest for 
the nonrelativistic Schrédinger equation is a single y field that is complex 
or nonhermitian. 

In the classical case we can express y in terms of real fields yi and 
Yo as 


= 24(y1 + tr) y* = 241 — tr) (54.26) 


We show first that the lagrangian equations of the form (54.8) obtained 
by independent variation of y and y* are equivalent to those obtained by 
variation of ¥; and yo. It follows from (54.26) that 


rs] 0 . oO fe] 
—_ = —~} —_—— —— = 9-4 
ayo (F - és) ye (a ie a) 


Thus the y, y* equations are obtained by adding and subtracting the yn, 
2 equations. 

In similar fashion, the classical momenta canonically conjugate to 
y and y* are seen to be 


nw = 27, — ime) ® = 25 (r1 + im) (54.27) 


The second momentum is written as 7 rather than 7* in order to emphasize 
the fact that it is defined as being canonically conjugate to y* and is not 
necessarily the complex conjugate of 7. Indeed, as we shall see in the 
next section, « is identically zero for the nonrelativistic Schrédinger 
equation. However, whenever the lagrangian is real, 7; and 72 are inde- 
pendent of each other and # = r*. In this case mi + toe = rH + 1*P*, 
and the hamiltonian is unchanged. 

The corresponding quantum case is obtained from the commutation 
relations (54.25) with s = 1, 2. If 7: and 72 are independent, then all 
pairs of variables except the following commute: 


[Yir,t),w(r’,e)] = [Y*(r,2),at@',2)] = hdr — 7’) (54.28) 


SSLIQUANTIZATION OF THE NONRECATIVISTIC 
SCHRODINGER EQUATION 


As a first example of the application of the field-quantization technique 
developed in the preceding section, we consider here the quantization of 
the nonrelativistic Schrédinger equation (6.16). The.application implies 
that we are treating Eq. (6.16) as though it were a classical equation that 
describes the motion of some kind of material fluid. As we shall see, the 
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resulting quantized field theory is equivalent to a many-particle Schroé- 
dinger equation, like (16.1) or (40.7). For this reason, field quantization 
is often called second quantization; this term implies that the transition . 
from classical particle mechanics to: Eq. (6.16) constitutes the first 
quantization. 


CLASSICAL LAGRANGIAN AND HAMILTONIAN EQUATIONS 
The lagrangian density may be taken to be 


h2 
L = thy*) — on Vy*- vy — Vir,ty*y (55.1) 
As shown at the end of the preceding section, y and y* can be varied 


separately to obtain the lagrangian equations of motion. The equation 
of the form (54.8) that results from variation of y is 


2 
—ithy* = — - V*y* + Vir,t)y* 


which is the complex conjugate of Eq. (6.16). Variation of y* gives 
Eq. (6.16): 


: h? 
thy = ——VY+ Vir,ty (55.2) 
2m 
The momentum canonically conjugate to y is 
oL 
r= 5 = ity’ (55.3) 


However ¥* does not appear in the lagrangian density, so that 7 is 
identically zero. It is therefore impossible to satisfy the second of the 
commutation relations (54.28) (or the corresponding classical Poisson- 
bracket relation), so that y*, + cannot be regarded as a pair of canonically 
conjugate variables. They can easily be eliminated from the hamiltonian 
since 7* never appears and Eq. (55.3) gives y* in terms of z.! 

The hamiltonian density is 


Haw —L = — yn.vy— vay (55.4) 
2m h 


1 The conclusion that x can be identified with y* is related to the appearance of only 
the first-order time derivative in the wave equation (55.2), since in this case ¥ can be 
expressed in terms of y and its space derivatives through the wave equation. If the 
wave equation is of second order in the time derivative, y and y are independent; 
then 7 is related to ¥* rather than to y*, and both y, x and y*, # are pairs of canonical 
variables. The nonrelativistic Schrédinger equation and the Dirac equation are of 
the former type, whereas the relativistic Schrédinger equation is of the latter type. 
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The hamiltonian equations of motion obtained from (54.19), with the 
help of (54.18), are 


= —— — V2 
b= — Wt s Wy 


‘= “ "— oh V'-r 
The first of these equations is the same as (55.2), and the second equation, 
together with (55.3), is the complex conjugate of (55.2). We have thus 
shown, from the point of view of classical field theory, that the lagrangian 
density (55.1) and the canonical variables and hamiltonian derived from 
it are in agreement with the wave equation (6.16) or (55.2). 


QUANTUM EQUATIONS 


The quantum equations are obtained by adopting the volume integral of 
(55.4) as the hamiltonian, (54.23) as the equation of motion, and the 
first of Eqs. (54.28) as the quantum condition on the wave field. Since 
y is now a Heisenberg-picture operator rather than a numerical function, 
y* is replaced by yt, which is the hermitian adjoint of y rather than its 
complex conjugate. Further, as remarked above, the Heisenberg-picture 
operators y, yf have no explicit dependence on the time, so that their 
equations of motion are given by (54.23) or (24.10), with the first term 
on the right side omitted and dy/dt on the left side identified with y. 
The hamiltonian is conveniently written with replacement of 7 by thyt 
and becomes 


H = / (= vy eve + vv) d*r (55.5) 


Application of Eqs. (22.15) and (22.16) then shows that H is hermitian. 
The quantized hamiltonian given in (55.5) is the operator that represents 
the total energy of the field; it is not to be confused with the operator 
(23.2), which is the energy operator for a single particle that is described 
by the wave equation (6.16). We have as yet given no explicit repre- 
sentation for the new operators y and H and therefore cannot say on 
what they might operate. “The choice of a particular representation is 
not necessary so far as the Heisenberg equations of motion are concerned 
but is desirable for the physical interpretation of the formalism that we 
give later in this section. 
. The commutation relations are 


vr) ve] = VI@)v'@)] = 0 


(55.6) 
v(x), ¥'(@")] = 8 — r’) 
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The omission of ¢ from the argument of the field variables implies that 
both fields in a commutator bracket refer to the same time. In accord- 
ance with the earlier discussion, the equation of motion for y is 


ih) = [yt] | 
= E i a vip vy a*'| +[¥ fvurvar] (65.7) 


_.Where primes indicate that an integration variable r’ has been substituted 
for r. The second term on the right side is easily evaluated with the 
help of (55.6) to give 


SV! — wy) dr” = JV" — VIN) dr’ 
= fV’y’ (ir — r’) d'r’ = Vy (55.8) 


y commutes with V, which is a numerical function. Evaluation of the 
first term on the right side of (55.7) is simplified by performing a partial 
integration on [Vyt’- Vy’ dr’ to obtain — fy'’V’2y’ dr’; the surface terms 
vanish because y either vanishes at infinity or obeys periodic boundary 
conditions. We thus obtain 


Sve" Vy dr’) = —ly furry’ dir'] 
= — fly Vy dir’ = —f (Wy) 8@ — 1) dr’ = —VY (55.9) 


Substitution of (55.8) and (55.9) into (55.7) yields Eq. (55.2), so that 
the equations obtained from classical and quantum field theories agree. 
A similar calculation shows that the equation zh¥~t = [yt,H] yields the 
hermitian adjoint of Eq. (55.2); it can also be seen directly that this 
equation is the hermitian adjoint of the equation zhy = [y,H] so long as 
A is hermitian. 

If V is independent of t, H has no explicit dependence on the time, 
and Eq. (54.23) shows that H is a constant of the motion. Thus the 
energy in the field is constant. Another interesting operator is 


N = f¥ty dr 


which we assume represents the number of particles in the field. We 
note first that NV is hermitian. Its time derivative is given by 


aN = [N,H] 
= | i Wty d*r, i, (Fe Vip evy’ + vy’) a’| (55.10) 


2m 
The commutator of N with the V part of H can be written as 


SIV WY — Vy) dr dir’ 
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With the help of (55.6) the parenthesis in the integrand is 
Vy — Wy = Vy + 8 — 2) — Ye 
= Wey’ y + VV — 1/) — vy 
= oily yt — er — 2b + oye — 2’) 
— yyy 
= 0 
since the 6 function vanishes unlessr = r’. A similar but slightly more 
complicated calculation shows that 


Wty, Vy vy’) = [piv’y — (Vy) y]- V8 — 4’) 
The double integral of this over rand r’ is zero. Thus Eq. (55.10) shows 
that N is a constant of the motion. 

It can also be shown that the commutator brackets in (55.6) are 


constants of the motion, so that these equations are always valid if they 
are at a particular time. 


THE N REPRESENTATION 

We now specialize to a representation in which the operator N is diagonal. 
Since N is hermitian, its eigenvalues are real. A convenient and general 
way of specifying this representation is by means of an expansion like 
(54.1) in terms of some complete orthonormal set of functions u,(r), 
which we assume for definiteness to be discrete. We put 


viet) = Laue) HL) = Yas ute) (55.11) 
k k 


where the uw, are numerical functions of the space coordinates and the a, 
are Heisenberg-picture operators that depend on the time. Equations 
(55.11) can be solved for the ax: 


a(t) = Sug (r)v(r,t) d'r ant(t) = Surry" (r,t) dr 


Thus, if we multiply the last of the commutation relations (55.6) by 
u; (r)u:(r’) on both sides and integrate over r and r’, we obtain 


[az (t), art (t)] = Sfug(r)m(r’) 68(r — 2’) d?r d3r’ = dy (55.12) 


because of the orthonormality of the u,. In similar fashion, it is apparent 
that a, and a, commute and that a,t and a; commute, for all & and 1. 
Substitution of (55.11) into the expression for N shows that 


N= Ne where Ni = ata, (55.13) 
k 


It is easily seen that each N; commutes with all the others, so that they 
can be diagonalized simultaneously. 
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CREATION, DESTRUCTION, AND NUMBER OPERATORS 


The commutation relations (55.12) for the operators a, and a,' were 
solved in Sec. 25 in connection with the harmonic oscillator. There it 
was found that the solution of Eq. (25.10), in the representation in which 
ata is diagonal, consists of the matrices (25.12). It follows that the 
states of the quantized field, in the representation in which each Me z 1S 
diagonal, are the kets 


\n1,N2, oo e Uy ees ) (55.14) 


where each n, is an eigenvalue of N; and must be a positive integer or 
zero. We also have the relations 


alm, ... Mm, ...) = mela, ...m—1,...) 
ayt|ni, . 2. my. . . ) = (me + Dim... m+1,.-..>) 


Thus a,t and a, are called creation and destruction operators for the state 
k of the field. 

The number operator N;, need not be a constant of the motion, 
although we have seen from Eq. (55.10) that N = ZN; is a constant. 
The rate of change of N;, is given by 


thN: = [axtax,H] 
where H is obtained from (55.5) and (55.11): 


2 
H = 2 a;tar i (= Vujs + Vuri+ Vupur) d*r 
= 2, aa [4 (-3 Loe es v) uy d*r (55.16) 


It is not difficult to show from (55.12) that a particular N; is constant if 
and only if all the volume integrals in (55.16) are zero for which either 
j or lis equal to k. These integrals are just the matrix elements of the 
one-particle hamiltonian (23.2), so that the necessary and sufficient condi- 
tion that N; be a constant of the motion is that all such off-diagonal 
elements that involve the state wu, be zero.! 

The case in which the ~ are eigenfunctions of (23.2) with eigen- 
values E, is of particular interest. The integrals in (55.16) are then - 
E,6;, and the field hamiltonian becomes 


H= > ayta,E, = DY NE; (55.17) 
k k : 


(55.15) 


This particular N representation is the one in which H is also diagonal; 


1 This result for the quantized field is closely related to the corresponding result, 
contained in Eq. (35.5), for the one-particle probability amplitude. 
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the ket |ni, . . . m, . . . ) has the eigenvalue 2n,L, for the total energy 
operator H. It is apparent that all the N, are constant in this case. 


CONNECTION WITH THE MANY-PARTICLE SCHRODINGER EQUATION 


The quantized field theory is closely related to the many-particle Schréd- 
inger equation discussed in Sec. 40. If the u% are eigenfunctions of the 
one-particle hamiltonian (23.2), the field theory shows that stationary 
solutions exist for which the number of particles n, in the kth state is a 
constant positive integer or zero, and the energy is 2n,#;,. Each solution 
can be described by a ket |m1, . . . mz, . . .); these kets form a complete 
orthonormal set, and there is just one solution for each set of numbers 
my, .... On the other hand, a stationary many-particle wave function 
like the y in Eq. (40.1) can be written as a product of one-particle wave 
functions u,(r)e—*#«/* if there is no interaction between the particles. 
The linear combination of such products that is symmetric with respect 
to interchange of any pair of particle coordinates can be specified uniquely 
by stating the number of particles in each state. Again, the number of 
particles in each state is a positive integer or zero, and the energy is the 
sum of all the particle energies. 

We see then that the quantized field theory developed thus far in 
this section is equivalent to the Schrédinger equation for several non- 
interacting particles, provided that only the symmetric solutions are 
retained in the latter case. We are thus led to a theory of particles that 
obey Einstein-Bose statistics. It can be shown that the two theories are 
completely equivalent even if interactions between particles are taken 
into account.! 

It is natural to see if there is some way in which the quantized-field 
formalism can be modified to yield a theory of particles that obey Fermi- 
Dirac statistics. As discussed in Sec. 40, a system of such particles can be 
described by a many-particle wave function that is antisymmetric with 
respect to interchange of any pair of particle coordinates. The required 
linear combination of products of one-particle wave functions can be 
specified uniquely by stating the number of particles in each state, pro- 
vided that each of these numbers is either 0 or 1. The desired modifica- 
tion of the theory must, therefore, limit the eigenvalues of each operator 
N, to 0 and 1. 


ANTICOMMUTATION RELATIONS 
A review of the foregoing theory shows that the conclusion that the eigen- 
values of each N;, are the positive integers and zero stems from the com- 
mutation relations (55.12) for the a, and a,'. Equations (55.12) in turn 


1See W. Heisenberg, ‘The Physical Principles of the Quantum Theory,’ App., 
sec. 11 (University of Chicago Press, Chicago, 1930). 
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arise from the commutation relations (55.6) for y and yt. Thus we must 
modify Eqs. (55.6) if we are to obtain a theory of particles that obey the 
exclusion principle. It is reasonable to require that this modification be 
made in such a way that the quantum equation of motion for y is the wave 
equation (55.2) when the hamiltonian has the form (55.5). 

It was found by Jordan and Wigner! that the desired modification 
consists in the replacement of the commutator brackets 


[A,B] = AB — BA 

in Eqs. (54.22) and (55.6) by anticommutator brackets 
[A,B], =AB+ BA 

This means that Eqs. (55.6) are replaced by 


Wr) ve], = Var’) + ¥r)Y(r) = 0 | 
Ve) ve = Vawter) + C)¥'@) =0 (55.18) 
VO), = vate’) + trv) = 8 — Pr’) 

It then follows directly from Eqs. (55.11) and (55.18) that 
[ax,a:]4 = axa, + wa, = 0 
[axt ait], = aytast + a;ta,t = ~ (55.19) 


[ax,a.t], = a,a;t + asta, = bx: 


We define N; = a,ta, as before and notice first that each N;, com- 
mutes with all the others, so that they can be diagonalized simultane- 
ously. The eigenvalues of N; can be obtained from the matrix equation 


Ni? = axtaxa,zta, = a,t(1 — azta,)a,-= ata, = Ni (55.20) 


where use has been made of Eqs. (55.19). If N;z is in diagonal form and 
has the eigenvalues n,, n,, . . . , it is apparent that N,? is also in diag- 
onal form and has the eigenvalues n,’, n,?, .... Thus the matrix 
equation (55.20) is equivalent to the algebraic equations 
npn nt = ni 

for the eigenvalues. These are quadratic equations that have two roots: 
Oand1. Thus the eigenvalues of each N; are 0 and 1, and the particles 
obey the exclusion principle. The eigenvalues of N = ZN, are the 
positive integers and zero, as before. The earlier expressions (55.16) and 
(55.17) for the hamiltonian are unchanged, and the energy eigenvalues 
are Dn, Ex. 


1P, Jordan and E. Wigner, Z. Physik 47, 631 (1928). 
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EQUATION OF MOTION 


_In order to find the quantum equation of motion for y when the hamilton- 
ian is given by (55.5), we must decide whether or not the general equation 
of motion (54.23) is to be retained. The latter equation was obtained by 
replacing the Poisson bracket by the commutator bracket in the classical 
equation (54.20). This replacement was justified by analogy with the 
particle theory of Sec. 24, by the identical algebraic properties of the two 
kinds of bracket expressions given in (24.27), and by the correspondence- 
theory argument of Prob. 9, Chap. 6. Thus abandonment of Eq. (54.23) 
means that the classical equation (54.20) is likewise abandoned. Since 
many of the quantities with which we are concerned (number of particles, 
energy, etc.) have well-defined classical analogs, we wish to retain (54.23) 
as the general quantum equation of motion. 

The equation for ¥ is then given by (55.7), where now the anti- 
commutation relations (55.18) are to be used in evaluating the right side. 
This causes (55.8) to be replaced by 


SV ry’ — vip) dr’ = SVQ + ty)’ der’ 
= fV'y’S8(r — r’) d'r’ = Vy 
Similar treatment of the first term on the right side of (55.7) causes no 
change in the right side of (55.9). Thus the wave equation (55.2) is 
unaffected by the substitution of anticommutation for commutation 
relations between the wave amplitudes. It can also be shown without 


difficulty that N and the anticommutator brackets in (55.18) are constants 
of the motion. 


PHYSICAL IMPLICATIONS OF ANTICOMMUTATION 


Since anticommutator brackets do not possess the algebraic properties of 
Poisson brackets, we can conclude that there is no classical analog for the 
quantities y and a, that satisfy the relations (55.18) and (55.19). This 
does not mean, however, that N and H do not possess classical limits, for 
they are constructed of bilinear combinations of y or a, and commute 
with each other. 

. These conclusions can be confirmed on the basis of physical consi- 
derations. In order that a field amplitude be strong enough to be 
classically measurable, it must be possible to have a very large number of 
particles in the same state so that their fields can be coherent. This 
implies that such particles obey Einstein-Bose statistics. We can, for 
example, conclude that light quanta or photons obey Einstein-Bose 
statistics, since it is. known that strong electromagnetic fields can be 
produced and are classically measurable. In the case of metallic elec- 
trons, which obey Fermi-Dirac statistics, quantities like energy, and 
charge and current density, are classically measurable since they can be 
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expressed as bilinear combinations of the field amplitude, whereas the 
electron field amplitude itself is not.? 


REPRESENTATION OF THE ANTICOMMUTING a, OPERATORS 


An explicit representation for the operators that appear in (55.19) is 
easily obtained in the hypothetical but instructive case in which the 
system has only one state. The matrix equations to be solved are 


a? = al? =0 aat+ata =1 N = ata (55.21) 


Equations (55.21) are just those solved in Prob. 4, Chap. 6. We have 
already seen in (55.20) that N? = N, so that N has the eigenvalues 0 and 
1. If there is no degeneracy, N can be represented by the diagonal matrix 


N = E | (55.22) 


It is interesting to note that a cannot be diagonalized since N has a non- 
vanishing eigenvalue. If it could be, the first of Eqs. (55.21) would show 
that the square of each of its eigenvalues is zero; this would mean that a 
and hence at and N are identically zero, so that N could not have the form 
(55.22) in any representation. 

Explicit matrices for a and at that agree with (55.21) and (55.22) are 


a= k a at = k | (55.23) 


The two kets that represent the two possible states of this system can be 
expressed as 


\0) = lo | ll) = H (55.24) 


It is easily seen that the first ket in (55.24) has the eigenvalue 0 for the 
operator N given in (55.22), and the second has the eigenvalue 1. The 
following relations are easily obtained from (55.23) and (55.24): 


a|n) = n|1l — n) at|n) = (1 — n){1 — n) (55.25) 


Thus a and at again play the roles of destruction and creation operators, 
respectively. 

In the actual problem, the number of states of the system is infinite, 
and it is not convenient to write explicit matrices like those in (55.23). 


! The classical measurability of the field amplitude for any charged particle (Einstein- 
Bose or Fermi-Dirac statistics) implies that ¥ appears linearly in H, since the energy 
must then depend on y itself as well as on bilinear combinations of y. .This in turn 
implies that terms linear in a; or a,t appear in the hamiltonian, so that single charged 
particles can be destroyed or created. Thus y cannot be measured classically if the 
theory is such that electric charge is conserved. 
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We can, however, find the effects of operating with a, and a,t on a ket 
|mi, .. . ,mx, . . .) that has the eigenvalue n, (= 0 or 1) for the operator 
N;,. The desired relations would have the form (55.25) were it not that a 
series of such equations (with subscripts added) would not agree with the 
first two of Eqs. (55.19). 

We therefore proceed in the following way. We order the states k 
of thesystem in an arbitrary but definiteway:1,2,...,k,.... Then 
operation with each a, or a," has the form (55.25), except that a multiply- 
ing plus or minus sign is introduced, according as the kth state is preceded 
in the assumed order by an even or an odd number of occupied states. 
We thus replace the Einstein-Bose equations (55.15) by the exclusion- 
principle equations 


ax|n1, see My . .) = 6:Nx|721, a tet Se yl — mM... >) 

axt|ni, «2. Mey...) = CL — m)[m, . . © 1 — m, . . .) (55.26) 
k—-1 

&, = (—1)”" "y= y Nj 
j=1 


As an example, we calculate the effect of operating with a,a; and with 
aa, on some ket, where we assume for definiteness that the order is such 
that 1 > k. If each operation is not to give a zero result, both n, and n; 
in the original ket must equal unity. Operation with a,a; empties first 
the ith and then the kth state and introduces a factor 0:4. Operation 
with aa, empties the kth state first, so that 6, is unchanged. But when 
the ith state is emptied in this case, there is one less particle in the states 
below the /th than there was in the previous case, since the kth state is 
now empty, whereas it was occupied before. Thus the sign of 6, is 
changed. We find in this way that 


Cadi ie od Ti ade Mee. Ga kN oe EM ee 


in agreement with the first of Eqs. (55.19). In similar fashion, it can be 
shown that Eqs. (55.26) agree with the result of operating with the other 
two of Eqs. (55.19) on any ket. Since the aggregate of kets represents 
all possible states of the many-particle system, they constitute a complete 
set, and Eqs. (55.19) follow as operator equations from Eqs. (55.26). 


56LJELECTROMAGNETIC FIELD IN VACUUM! 


We now apply the methods developed in Sec. 54 to the quantization of 
the electromagnetic field in vacuum. Since we are not concerned with 


1 For further discussion of the material in this section and the next, see the references 
cited in footnote 1, page 491, and also E. Fermi, Rev. Mod. Phys. 4, 87 (1932); L. 
Rosenfeld, Ann. Inst. Henri Poincaré 1, 25 (1931); W. Heitler, ‘The Quantum Theory 
of Radiation,” 3d ed. (Oxford, New York, 1954). 
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establishing the Lorentz invariance of the theory, we use three-dimen- 
sional rather than four-dimensional notation throughout.. We start by 
finding a lagrangian whose variation yields Maxwell’s equations. From 
this, canonical momenta can be defined, and a hamiltonian can be set up. 
Quantization is effected by replacing the classical Poisson brackets by 
commutator brackets. We shall not consider the possible existence of 
anticommutation relations between the field variables, since experiment 
shows that strong electromagnetic fields are classically measurable and 
that photons obey Einstein-Bose statistics. 


LAGRANGIAN EQUATIONS 


Maxwell’s equations in empty space are obtained by setting p and J equal 
to zero in Eqs. (44.2): 


10H _ 10E _ 
YX > VxH~--— = 0 (56.1) 


vV-E=0 v-H=0 


The lagrangian is most conveniently expressed in terms of the potentials 
A, ¢ that are partially defined by — 


10A 
ES ae Ve H=vVxA (56.2) 


As discussed in Sec. 44, this does not specify the potentials completely, 
since gauge transformations of the potentials can still be made without 
altering the electric and magnetic field strengths computed from (56.2). 

The lagrangian density can be taken to be 

1 /10A 2 1 
= —f-—- — fore See 2 

L 3 € aE + ve) 3, (Vv x A) (56.3) 
The lagrangian equations are obtained from (54.8) if we regard Az, Ay, 
A,, and ¢ as the field variables. Variation of the components of A gives 
three equations that can be written together as 


1 1 @f1loA 
~pvxvxa- 25 (iF + ve) =0 


This is the same as the second of Eqs. (56.1). Variation of ¢ gives 


1 10A 
- EV (CG + ve) <0 


which is the same as the third of Eqs. (56.1). The definitions (56.2) for — 
the potentials automatically satisfy the other two of Maxwell’s equations. 
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HAMILTONIAN EQUATIONS 


The momentum canonically conjugate to Az is found from (54.15) and 
(56.3) to be . 


_ 1 (180A, , 0¢ 
P, = 7 ( = + $s) (56.4) 


with similar expressions for the other two momenta. The momentum 
canonically conjugate to ¢ vanishes identically, since ¢ does not appear 
in the lagrangian density. A similar situation was encountered with y* 
in the nonrelativistic Schrédinger equation (Sec. 55); as before, it means 
that ¢ cannot be treated as a field variable and must be eliminated from 
the hamiltonian. 

The hamiltonian density is obtained from (54.24): 


H =P. _ 1, = dnctP? + + (v x A)? — P+ VS (56.5) 


where use has been made of (56.4) to replace 0A/dt by terms involving P. 
The hamiltonian equations of motion (54.19) are 

edrtP— eve 9 =~ 1 vxva (56.6) 
The first of these equations is the same as (56.4); it is necessary that (56.4) 
be obtained over again in this way, since the hamiltonian formalism con- 
sists only of (56.5) and the canonical variables A and P. We can now 
make use of this equation to define a quantity E that is equal to —4mcP. 
Then the second of Eqs. (56.6) agrees with the second of Maxwell’s equa- 
tions (56.1), if we also define a quantity H to be equal to V x A. The 
first and fourth of Eqs. (56.1) are satisfied because of the way in which 
E and H are defined. 

The third of Maxwell’s equations cannot be obtained as a hamilton- 
ian equation based on (56.5). We can, however, say that we shall be 
concerned only with those solutions of the hamiltonian equations for 
which V-E = 0, or V- P = 0, at some definite time. If then we can 
show that this restriction is maintained at all times, the solutions so 
chosen form a consistent and satisfactory set. The time derivative of 
V - P is found from the second of Eqs. (56.6) to be 
oyip = ~tyv.vxvxA=0 

ot Ar 
Since the field equations are of first order in the time derivatives, we have 
shown that the restriction that V - E = 0 at one instant of time is equiva- 
lent to the validity of the third of Eqs. (56.1) at all times. 
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We now see that the last term in the hamiltonian density (56.5) con- 
tributes nothing to the field hamiltonian. Its volume integral can be 
transformed by means of a partial integration into cf¢V + P d*r, which is 
equal to zero; the surface term vanishes because P either vanishes suffi- 
ciently rapidly at infinity or obeys periodic boundary conditions at the 
walls of a large box. The hamiltonian is then 


H = / | arctP* aa es (vx ay| d* (56.7) 


and ¢ has disappeared. This is in agreement with the usual expression 
(1/8) f(E2 + H?) dr for the total energy in the electromagnetic field. 
QUANTUM EQUATIONS 
The classical electromagnetic field is converted into a quantum field in 
the following way. We start with the hamiltonian (56.7) and the 
canonical field variables A, P. Since ¢ no longer appears, it is con- 
venient to choose the gauge so that ¢ = 0. The general equation of 
motion is (54.23), and the commutation relations (54.25) between the 
field variables become 
[A.(r,t),A.(r’,6)] = [P.@,t),P.(t’,t)] = 0 
[A.(r,t),P.(r’,t)] = thé. 5(r — 1’) 
Each of the indices s, s’ can be @, y, or z. 
The equation of motion for a typical component of A is 
ihA,(r,t) = [A.(1,t),H] 


A,commutes with the (V x A)? term in H and also with that part of the P? 
term that involves P,? + P.?; thus we need calculate only the commutator 
of A, and the P,? term in H. This is 2rc? times the integral over r’ of 
[A.,P,2], where the prime indicates that the argument is r’ rather than r. 


[A.(r,t),P.2(r',t)]) = A,P,? — PA, 

= (PLA. + thor — r’)|P, — PPA. 
PLPLA, + thd8(r — 1’)] + ch53(r — 2’) PP, — PPA 
= 2hd(r — r’)P,(r’,t) 


(56.8) 


Integration over r’ gives the relation 
thA,(r,t) = 2mc?f2ihd%(r — r’)P,(r’,t) dr’ 
= 4rc*ihP,(1,t) (56.9) 


This is the same as the corresponding classical equation, which is the first 
of Eqs. (56.6), when ¢ = 0. 
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The equation of motion for a typical component of P is 
whP.(1,t) = [P.(r,t),H] 


P, commutes with all the integrand of H except for that part which con- 
tains the sum of the squares of the y and z components of V x A. The 
calculation of this commutator bracket is straightforward but tedious 
(see Prob. 10) and yields an expression for P that is in agreement with the 
second of Eqs. (56.6). Thus if wedefine E = —4rcPand H = V x A, the 
quantum equations of motion for A and P agree with the first, second, and 
fourth of Maxwell’s equations (56.1). 

The third of Maxwell’s equations must be imposed as a supplemen- 
tary condition, as in the classical case. If we set V+ P equal to zero at a 
particular time, it is always zero since its time derivative is zero. Equa- 
tion (56.9) then shows that the time derivative of V - A is always zero, or 
that V-Aisaconstant in time. It is convenient to restrict the choice of 
gauge further so that V- A is zero everywhere at a particular time, in 
which case we see that it is zero at all space-time points. It is apparent, 
however, that the introduction of the supplementary condition is incon- 
sistent with the commutation relations (56.8). For example, the com- 
mutator bracket of A, and V- P should be zero, since V : P is zero, but 
is computed from (56.8) to be 


0 
ox’ 


[A,(1,t), V+ P(r’,t)] = th | Hr — 1’) 

It is not surprising that this inconsistency should arise, since Eqs. (56.8) 
imply that there are three independent pairs of canonical variables, 
whereas the restrictions V- P = 0 and V- A = 0 cause only two of these 
pairs to be linearly independent. We should therefore modify the com- 
mutation relations so that they are consistent with the supplementary 
condition. : 

The nature of this modification is established in Prob. 11. It turns 
out that the commutator brackets of A(r,t) and P(r’,t) do not vanish when 
r —r’ is finite. This would appear at first to contradict the physical 
principle that there can be no interference between electromagnetic field 
measurements performed at different places and the same time since this 
field has a finite speed of propagation. However, the vector potential A 
is not in itself a physical quantity; only the electric and magnetic fields 
are directly measurable. We shall now show with the help of (56.8) that 
the commutation relations of E and H have the required infinitesimal 
character and are, moreover, consistent with the supplementary condition 
v-E=0. It can also be shown (see Prob. 12) that the same results are 
obtained by starting with the modified canonical commutation relations 
of Prob. 11. 


THE QUANTIZATION OF WAVE FIELDS _ 513 
COMMUTATION RELATIONS FOR E AND H 
The electric and magnetic fields are defined by the equations 

E = —4cP H=vxA (56.10) 


where the commutation relations for A and P are assumed to have the 
form (56.8). We see at once that 


[E.(r,¢), E.-(r’,t)] => [H.(r,¢), H.-(1’,t)] = 0 (56.11) 


where each of the indices s, s’ can bez, y, or z. The commutator bracket 
for typical parallel components of E and H is 


(ted Aol Ste | Pn Ge al = (56.12) 


For typical perpendicular components of E and H, we obtain - 


aA! aA! 
rae | P= (FF =) | 


é / 
4irc aa! [P:,A{] 


[Ex(r,4), Hy(r’,t)] 


sci = — 7 8 (xr — r’) (56.13) 


Other relations similar to (56.13) are obtained by cyclic permutation of 
v, Y; @. 

It follows at once from (56.11) that V - E commutes with all com- 
ponents of E. The commutator bracket of V - E and a typical component 
of H is, with the help of :(56.13), 


Iv-E, H’] = kz Hs] a E | 


0z 
Paes ¥ ig 3 = 3 — 
= Ancit | wap oe + 5g Oe r) 
(56.14) 
Since (0/dy’)63(r’ — r) = —(0/dy)s3(r' — r), we see that the right side of 


(56.14) is zero. Thus V - E commutes with E and H and hence also with 
the hamiltonian, which from (56.7) can be written as 


se = f (E? + H?) dr | (56.15) 


This means that V-E is a constant of the motion and so is zero at all 
space-time points if it is made to vanish everywhere at a particular time. 

As would be expected, the field commutation relations (56.11) to 
(56.13), together with the hamiltonian (56.15), can be used in place of the 
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canonical formalism originally developed in terms of Aand P. We have 
already seen that V- E is a constant of the motion; a similar calculation 
shows that V - H is constant, so that it can also be made to vanish at all 
space-time points. The first two of Maxwell’s equations (56.1) then 
follow as special cases of the general equation of motion (54.23) (see 
Prob. 14): 


inks = (Ee, H) = = f(s, (Hy + He) at! 


(56.16) 
He = (He H] = = f tHe, (Ey + E22] 


PLANE WAVE REPRESENTATION 


For many applications, a representation of the potentials and fields in a 
complete orthonormal set of plane waves is useful. These plane waves 
are taken to be vector functions of r that are polarized perpendicular to 
the propagation vector so that the conditions V-A = V-P = 0 are 
satisfied. 


U(r) = L-te,, exp (ik - r) A= 1,2 


The vectors k are chosen as in (11.3), so that the u,, satisfy periodic 
boundary conditions at the walls of a large cubical box of volume L?. The 
&, are unit vectors, and 4), tk, and k form a right-handed set, so that 
k-e,, = 0 and V- uy, = 0. It is easily verified that the orthonormality 
property assumes the form 


i Us © User? r= Sy dyy! 
We expand A and P in terms of the uy): 


A(r,t) = ¥" [aen(t)usa(t) + deat Quin (2) 
kA 


Port) = Y’ (rate) + Poa’ Oun)] 


kA 


(56.17) 


The operators qt and py,t are the hermitian adjoints of gy, and pra, 
respectively; thus A and P are hermitian. The primes indicate that the 
summations extend over half the k space, so that the plane waves uy, do 
not duplicate u_y,. 

We take for the commutation relations between the q’s and p’s 


[qurn(t), Puy’ (t)] = [qa' (4), Pen (é)] = thoy, dy (56.18) 
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with all other pairs commuting, and verify that they give the correet com- 
mutation relations between A and P. It is apparent that 


[A.(x,t), Aw(r’,t)] = [P.(r,t), Pe (t’,)] = 0 
We also obtain from (56.17) and (56.18) 
[A.(r,t), Poe] = SY DY (Laen(t), Pernt eter e (tub yser(2") 


kA OK! 
+ [Qiat(t), Durr (t) iy. (1) Ura’ e(r’) } 
taAL-3 > €xn,2€kn.2’ EXP [7k + (r — r’)] (56.19) 
kd 


The subscripts s, s’ denote cartesian components of the vectors on which 
they appear; the prime has been removed from the last summation in 
(56.19) since the primed summation over terms with both k and —k is 
equivalent to a summation of k terms over the entire k space. 

If there were three mutually perpendicular unit vectors e,,, then the 
three numbers «,,. would be the direction cosines of the cartesian direc- 
tion s, and we would have that » €ic,2€kn,s' = Oss*. Since there are just 


two unit vectors een that are ‘se nahidioutes to each other and to k, we 
can write 


kk 
k2 


es = bss° — 
r» 


We also have that 


o 9 exp fk (r — v)] 


k,ky exp [tk- (r — 1’)] = an, Or’, 


With these substitutions and with the replacement of L~-? ») by (2r)-3 ( dk 
when ZL is large, we can rewrite (56.19) as 


[A.(r,t), Pa(1’,t)] = thd. {(20)-?f exp [ck + (x — r’)] d?k} 


—th ar, ar 7 [ny a exp [tk- (r — r’)] we| (56.20) 
The first brace expression in (56.20) is equal to 6(r — r’). The second 
brace can be calculated as a special case of Eq. (37.18) and is equal to 
(4x|r — r’|)-!. The commutator bracket (56.20) then becomes 


(rd) = thé..6%(r —r') ~22 9 9 f_ 1 
[A.(r,t), Pa(1’,2)] = thd,.°6 . r’) dn Ore ari, (; =F (56.21) 


which is that assumed in Prob. 11; the other commutator brackets vanish. 
This confirms the choice of the commutation relations (56.18). 
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QUANTIZED FIELD ENERGY 
Substitution of (56.17) into the field hamiltonian (56. 7) gives 


, k? 
H = 2 (4retpespn! + i mon) (56.22) 


since gx, and gy,', and py, and py, commute; here use has been made of the 
restriction on the summation to half the k space, which makes all integrals 
of the form fuy, + Uy, d’r vanish. 

We wish now to find the eigenvalues of H when the commutation 
relations are as given in (56.18). This can be done by choosing linear 
combinations of the plane wave amplitudes that make H formally equiva- 
lent to the sum of the energies of a number of harmonic oscillators (see 
Sec. 55). Now each index pair k, \ corresponds to two linearly polarized 
plane waves that travel in opposite senses along the k direction. Thus 
we want our new linear combinations of TerUlicn and pau, to have the 
general forms 


Oy exp i(k +r — ket) at exp i(k-r + ket) (56.23) 


where a, and aj, are operators that are independent of randt. The first 
of these is a plane wave that travels in the positive k direction and has 
the positive angular frequency ke, and the second is the hermitian adjoint 
of a plane wave that travels in the negative k direction and has the same 
frequency. 

With the remarks of the preceding paragraph as a guide, we proceed 
to find the time dependence of gy, and py,. Their equations of motion 
are given by (54.23): 


thd = [qe,H] = 4rthc* pen 


; hk? . | (56.24) 
thie = [po,H] = — 7 Qn 


Elimination of py, gives a second-order equation for gx): 
Gex = 400’ Pyy = — KC? 
This is easily integrated to give 
Qer(t) = aye + ayateteet | (56.25) 


where we have followed the pattern indicated by (56.23). We then 
obtain from the first of Eqs. (56.24) 


= tk —tket tk »t tket 
pur(t) = Gre mene i oe e (56.26) 
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Equations (56.25) and (56.26) can be solved for the a’s. 


Ant 
ax = ¥ (om + — po) etket 
(56.27) 


, 
Qk)! 


4rd 
3 (m oz —— . po) oe 


Similar relations hold for their hermitian adjoints. The commutation 
relations for the a’s can be obtained from (56.27) and (56.18): 


h 
[ax 04-9’) = [akn,a’by’] = om On’ dar’ (56.28) 


with all other pairs commuting; these are independent of the time, as 
they should be. 

Substitution of (56.25) and (56.26) into the hamiltonian (56.22) 
gives 


| | 
H =) x Casas? 2 alta (56.29) 


KA 


If we adopt the definitions 
Nw = eis ax Tay Nin = cum Ow aka (56 30) 
NS Dahle OES atic 


we see from the work of Sec. 55 that Ni, and Ni, each have the eigen- 
values 0, 1, 2,.... In terms of the N ’s, the hamiltonian (56.29) 
becomes 


H = Y" tck(Na + Nin t+ 1). (56.31) 
kA 


Because of the structure of (56.23) and (56.25), we can identify a, with 
@_4,, and Ny, with N_y. Then the restriction on the summation of 
(56.31) to half the k space can be removed to give 


H = ¥ fick(Nin + 4) (56.32) 
ky 


Equation (56.32) is equivalent to Planck’s quantum hypothesis: The 
energy associated with each plane electromagnetic wave is an integer 
multiple of the fundamental quantum hy = hike. In addition to the 
Planck energy, however, there is the harmonic-oscillator zero-point energy 
of one-half quantum per state of the field, which is infinite since there are 
an infinite number of states. This infinite energy is not objectionable 
since it does not interact with charged matter.: 


1 See also the discussion at the end of Sec. 57. 
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QUANTIZED FIELD MOMENTUM 


The momentum density of an electromagnetic field is the Poynting vector 
(c/4r)E(r,t) x H(r,t) divided by c?. The total momentum in the field can 
then be written in terms of the canonical variables, with the help of 
(56.10), 


| 1 
G= 7 [ExHar=— [Px (Vv xA)dr 


Substitution from (56.17), (56.25), (56.26), and (56.30) gives 


I 


, 
G=1 > k(pingiat — Dialer) 
PN 


1 , 
z=) kk[(@uxdeat + aiartazr) — (dirdiat + aiatain)] 
1 oN 


I 


>: AK[(Nia + 2) — (Nin + 2)] = Y, tN (56.33) 
EA 


ky 


where the restriction on the summation is removed as in the transition 
from (56.31) to (56.32). In this case, the zero-point terms cancel for 
plane waves that travel in opposite directions. 

Equations (56.32) and (56.33) show that the energy and momentum 
of each plane wave are quantized in units of ike for the energy and Ak for 
the momentum. It will also be shown explicitly at the end of this section 
that the quantized electromagnetic field propagates with the classical 
speed of light. 


A(r,{) IN THE PLANE WAVE REPRESENTATION 


The vector potential appears in the interaction between charged particles 
and the electromagnetic field that is discussed in the next section. We 
shall therefore require an expression for A(r,f) in the plane wave repre- 
sentation that is specified by the eigenvalues ny, of the operators Ni. A 
typical ket for this representation can be written | - - - ma °° - ), which 
describes a state of the electromagnetic field in which there are m, light 
quanta with momentum Ak and polarization e,,. We then see from Eq. 
(56.28) and the work of Sec. 55 that the operators a and a,,' have the 


properties 
Onrh t 
kr : i Nn -: -) = Ge - kn — 1-- -) 
2rh 1) }? 
at sg pm [PAE DT my ta 9 


(56.34) 
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We obtain from (56.17) and (56.25) 


AG = IY’ sal(ane + abstet) exp (ik 
+ (auatetot + ayne*"") exp (—ik - r)] 
= [-3 ») £4 {din exp [1(k- r — ket)] | 
+ ay' exp [—7(k-r — ket)]} (56.35) 


Here again the restriction has been removed from the summation by 
identifying ay, with a4. This expression for the vector potential is 
easily seen to be hermitian, as it must be. It follows from the structure 
of (56.34) that a,, and a4,' are destruction and creation operators, respec- 
tively, for a light quantum in the state k, \. Thus a term in the hamilton- 
ian linear in A would give rise to the absorption and emission of light 
quanta. 


COMMUTATION RELATIONS AT DIFFERENT TIMES 
It is interesting to generalize the commutation relations (56.11) to (56.13) 
for the components of E and H to the case in which the times are different. ! 
The results show under what circumstances measurements of the electro- 
magnetic fields at different space-time points affect each other. 
Expansions for E and H, in terms of the a,, that are analogous to 
(56.35), can be found without difficulty: 
E(r,t) = L-3 > they {Qa exp [i(k-r — ket)] 
ich 
— Gt exp [—7t(k +r — ket)]} (56.36) 
H(r,t) = L- > U(K X ey) {dy exp [7(k-r — ket)] 
3 
— Q,' exp [—7(k-r — kct)]} 


The commutator bracket for two cartesian components of the electric 
field strength is 


{Es(r,t), Ex (r’,t’)] = L-3 > A4nthckeg s€y,2' Sin (K+ @ — ker) (56.37) 
kA : 
e=r-r r=t-?t 


where use has been made of (56.28). The summation over the polariza- 
tion index \ can be evaluated by the technique used on the similar summa- 


1 These commutation relations are due to P. Jordan and W. Pauli, Z. Physik 47, 151 
(1928). 
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tion in (56.19): 


¥ easton’ sin (K+ @ — er) = Jy (Rb — kjky) sin (kk 9 — ker) 
x 


~ Re 


The summation over k can be evaluated by replacing it with an integral 
for large L: 


L-* Yk! siti (+ @ — ker) —> (2m)-* [ t'sin (e+ 9 — ker) ak 
k 
= — (45%)! oe [cos (kp — ker) — cos (kp + ker)] dk 


after performing the integration over the angle between k and 9; we call 
this expression D(o,r). Equation (11.10) can then be used to express D 
in terms of 6 functions: 


D(g,7) = (4xp)—[8(p + er) — 5(p — cr)) (56.39) 
It then follows from (56.37) and (56.38) that 
[Es(r,2); E,-(r’,t’)] 


, (iar 9 9 8 @ ; : 
= 4athe (es a at ar, art, Dir —r,t t ) (56.40) 


The commutator bracket for two cartesian components of the magnetic 
field strength can be found in the same way and is 
[H.(r,t), Hee(r’,t’)] = (Es(r,¢), Exr(r’,€’)] (56.41) 


An expression for the commutation relation between components of 
E and H can also be obtained from (56.36). 


{E.(r,é), H,-(r’,t’)] 
as » 4mthok (€x1,s€ks.0" — €k2.s€ks.2’) Sin (K+ @ — ker) (56.42) 
k 


It is apparent that (56.42) vanishes if s = s’, so that parallel components 
of the electric and magnetic field strengths commute at all space-time 
points. If s ~ s’, we can put s = z, s’ = y, where z, y, z form a right- 
handed set of axes, in which case we see that 


kz 
€i,2€k2,y — €k2,2€kiy = (ei X &kz)2 = ; 
An analysis similar to that which led from (56.37) to (56.40) then shows 
that ; 


MAL a acy 2 2 
{E-(r,t), H,(r it )] = 4xih az of 


The subscripts z, y, z can be permuted cyclically in (56.43). 


Da —r,t—?#) (56.43) 
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Since all these commutation relations involve the D function (56.39), 
we see that all components of the field strengths commute except in the 
infinitesimal neighborhood of the hypersurfaces c(é — ¢’) = +|r —1’| in 
four-dimensional space-time. These hypersurfaces constitute the light 
cone, which is the locus of all light pulses that pass through the space 
point r =r’ at the time t = ¢’. Thus the field strengths at space-time 
points so situated that a light signal cannot pass from either one to the 
other commute with each other, and both can be measured precisely. 
This shows that the quantized electromagnetic field is propagated with 
the classical speed of light, c. A discussion of the connection between 
these commutation relations and the uncertainty principle has been given 
by Bohr and Rosenfeld.! 


57LJINTERACTION BETWEEN CHARGED PARTICLES AND 
THE ELECTROMAGNETIC FIELD 


We now wish to combine the quantization of the nonrelativistic Schré- 
dinger equation (Sec. 55) with that of Maxwell’s equations (Sec. 56), to 
obtain a quantum theory of the interaction of charged particles with the 
electromagnetic field. As before, we start with a classical lagrangian, 
use it to obtain the canonical variables and the hamiltonian, and then 
carry through the quantization. We shall apply the resulting formalism 
to the calculation of the coulomb interaction between charged particles 
and also the emission and absorption of radiation. The latter will enable 
us to place the theory of Chap. 11 on a sounder basis. 


LAGRANGIAN AND HAMILTONIAN EQUATIONS 


Our lagrangian must yield the nonrelativistic Schrédinger equation (44.1) 
and its complex conjugate through variation of y* and y, respectively, 
and also the inhomogeneous Maxwell equations (44.2) through variation 
of A and ¢. The sum of the lagrangian densities (55.1) and (56.3) has 
these properties, provided that we make the substitutions 


in vine “ ee ivy —invy —L Ay (57.1) 


in the former, where e is the charge of the particle. This means that the 
electric charge and current densities that appear in (44.2) are those 
obtained by multiplying Eqs. (7.1) and (7.3) by e and making the 
substitutions (57.1): 


p = ep*y Se y*Vy — (Vy*)¥] — = ayy _ (57.2) 


1N. Bohr and L. Rosenfeld, Kgl. Danske Videnskab. Selskab, M at.-Fys. Medd. 12, 
8 (1933). 


a 
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It was shown in Prob. 2, Chap. 11, that this p and J satisfy the equation 
of continuity when y is a solution of (44.1). We thus arrive at the 
lagrangian density 


toon (oth- a) (oreo (Corin) 


: 10A 1 
— Vy vte = (a + 8) — go (V x A)? (57.3) 
It is shown in Prob. 17 that Eqs. (44.1) and (44.2) do indeed follow from 
Eqs. (57.3) and (54.8). 

Equation (57.3) suffers from the defects noted earlier in connection 
with (55.1) and (56.3). Since the time derivatives of y* and ¢ do not 
appear, their canonical momenta cannot be defined, and they must be 
eliminated as coordinate variables from the hamiltonian theory. As 
before, the momentum canonically conjugate to y is thy*, and the 
momenta canonically conjugate to the components of A are the com- 
ponents of P = (4mc)—1[(1/c)(d@A/dt) + V¢]. The hamiltonian then 
becomes 


H = [ (ime H+ e- ae ay 7 


ai (omen a (nae 
+ Vyty + 2nc?P? + = (Vv x A)? —cP- vo| d’r (57.4) 


and y* appears as a canonical momentum variable. It is not difficult 
to show that the hamiltonian equations of motion for y and its canonical 
momentum 7hy* are (44.1) and its complex conjugate, respectively. The 
hamiltonian equations for A and P are 


= = 4rc°?P — cV¢ 
aP 1 1 Fn) 


where J is given in (57.2). Thusif we define E = —4rcP andH = V x A, 
as in Sec. 56, we obtain the first, second, and fourth of Maxwell’s equations 
(44.2). 


ELIMINATION OF ¢ 


The third of Maxwell’s equations (44.2) must be obtained as a supple- 
mentary condition, as was the corresponding equation in Sec. 56. We 
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shall therefore be concerned only with those solutions of the hamiltonian 
equations for which V- E — 4rey*y = 0 at some definite time. If then 
the time derivative of this quantity is zero, the restriction is maintained 
at all times, and the solutions so chosen form a consistent and satis- 
factory set. With the help of the first of Eqs. (57.2), the second of Eqs. 
(57.5), and the definition of E, we see that 


: *y) = —4nV-J — 4n 2? 
53 (VE — Aree ¥) = —4nV-J — 40 at 


This is zero because of the equation of continuity for the electric charge 
and current densities. 

We can now see that the two terms on the right side of (57.4) that 
involve ¢ cancel each other. The second ¢ term can be integrated by 
parts to give 


—c[P-V¢d'r = c[oV- Pdr = —fop dr 


which is equal to and opposite in sign from the first @ term ef ¢y*y d?r. 
Thus ¢ has disappeared from the hamiltonian and may be chosen in any 
convenient way. The choice is made so that, when P (or E) is divided 
into solenoidal and irrotational parts, the latter part is expressed entirely 
in terms of ¢. We put 


P=P,+P, 
v-P,=0 P, is solenoidal (57.6) 
VxP.=0 P, is irrotational 


If now we put Pz = (47c)~1V¢, we see that the third of Eqs. (57.6) is 
satisfied and that the first of Eqs. (57.5) becomes 


aA 
ai 7 Axe'Ps (57.7) 


It is now possible to have V- A = 0 at all times if it is true at any one 
time, since (57.7) and the second of Eqs. (57.6) show that (0/dt)V - A = 0; 
we therefore choose the gauge so that V- A = 0. This choice is usually 
referred to as the radiation gauge or the coulomb gauge, since there is a 
clear separation between the radiation field, described by P; or A, and 
the coulomb interaction, which we shall see below is described by Pz. 
or ¢. 

The ¢ potential reappears in the hamiltonian (57.4) through the 
P? term. The volume integral of P? can be written, with the help of the 
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~ expression for P, and a partial integration, 
i, Pedi = [ Pedr + i (2P, + P.) - Pp dr 
= [Pear cae / (2P, + Ps) «Ved 
4c J * ith 
= / P,2 dr — ae J $V + (2P; + P2) dr 
4ac ; 


Now V- P, = 0, and the supplementary condition is that V-P, = —p/c; 
thus the P? term in H becomes 

Qnc?f{P2 dir = 2ec2?[P,2 dr + 3f gp dir (57.8) 
The choice of ¢ tells us that | 

V2h = 4ncV > P, = —4rp 


This equation can be integrated by making use of the free-particle Green’s 
function (37.19) with k = 0; the result is 


o(r,t) = | rae . i) ae : (57.9) 
The hamiltonian on can now be rewritten, with the help of (57.8) 
and (57.9), 


H = / es | (#¥ om <A) "| 2 | (av = <A) | as vy | dr 
+f [2roPe + 3 +x ay| dr 
apet i oe o(t,t)a(r',t LS ni d?r dir’ a | (57.10) 


Here, V-Pi = V-A = A and (r,t) = ew*(r,t)y(r,t). The last term in 
(57.10) is the internal coulomb energy of the electric charge distribution 
p(r,t); it results from the elimination of ¢ and the irrotational part of P 
and does not have to be inserted into the theory as a separate assumption. 

The solenoidal field vectors (P; and A) are usually referred to as the 
transverse part of the electromagnetic field, since the electric and magnetic 
field strengths in the corresponding plane-wave solutions are transverse 
to the direction of propagation, as in Sec. 56. The irrotational coulomb 
field vector (P;) is called the longitudinal part of the field, since Eq. (57.9) 
shows that the contribution to P, at one point from an infinitesimal 
element of charge at another point is along the line joining the two 
points. 
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QUANTIZATION OF THE FIELDS | 


We can now obtain a quantum theory of the interacting particle and 
electromagnetic fields by adopting the equation of motion (54.23), the 
particle field commutation or anticommutation relations (55.6) or (55.18), 
and the electromagnetic field commutation relations (56.21) with Pi sub- 
stituted for P. It will also be assumed that all components of y and yt 
commute with all components of A and P.. 

The order of factors like y* and y that are multiplied together in 
the hamiltonian (57.10) is immaterial in the unquantized case. In the 
quantum theory, however, yt and y do not commute with each other, and 
the result of a particular calculation will depend on the order in which 
they appear in the hamiltonian. We shall see later on in this section 
that a suitable hamiltonian is obtained if all terms are left as they are, 
except for a change in the integrand of the coulomb term. This integrand 
in (57.10) contains the term 


p(r,t) p(x’ ,t) = e*y" (r, tw(r,t)yt (r’ ,t)y(r’,t) (57. 1 1) 
which we replace by! ‘ 
ey (rA)yt(’ Ar’ W(t) (57.12) 


It ean be shown with the help of (55.6) and (55.18) that (57.11) is equal 
to (57.12) plus 


e*yt (r,ty(r’ t)d%(r — 1’) 
regardless of whether commutation or anticommutation relations are 
used. Thus the change from (57.11) to (57.12) is equivalent to the sub- 
traction from the hamiltonian (57.10) of the quantity 

2 t 1 4) 33(¢ — 7! 
“ [% (TQ) HO =F) 5, gap! 


Fo (57.13) 


This is evidently infinite unless y'(r,t)y(r,t) is zero; it will be shown below 
that this can happen only if there are no particles present in the field. 

The quantum equations of motion are obtained from (54.23) when 
the hamiltonian (57.10) is modified by substitution of (57.12) for (57.11). 
The electromagnetic field equations agree with those obtained in Sec. 56, 
except that P: replaces P and the electric current term appears as in the 
second of Eqs. (57.5). The particle field equation is the same as (44.1) 
except that ¢ is replaced by 

: i vr Da’, t) dr’ 

Fl 

It can then be shown that the time derivatives of the bracket expressions 


1 Note that both (57.11) and (57.12) give hermitian contributions to H. 
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in (55.6) or (55.18) and (56.21) are zero, so that these equations hold for 
all time if they are valid initially, as was assumed (see Prob. 19). 


INCLUSION OF STATIC FIELDS 


Thus far we have assumed that the electric charge and current densities 
arise entirely from charged particles that are described by the y field. 
The effect of a static charge distribution can easily be included by the 
addition of a term 4p, to the right side of the third of Eqs. (44.2), and 
a term e¢.y to the right side of (44.1), where V?6, = —4p,. It is not 
difficult to see that the only effect on the hamiltonian (57.10) is to add a 
term fed.y*y d®r. 

The situation of greatest practical interest is that for which 
ed, = —Ze?/r. This corresponds to an electron in the vicinity of a 
fixed (infinitely massive) point nucleus of atomic number Z, placed at 
the origin. With this addition and the modification (57.12), the quan- 
tized version of the hamiltonian (57.10) becomes 


w= fl (arta) } [Car -29) 


+ Vyty -— — v4 d*r + | ene =e = AY. x ay | d*r 
Sif eer vorvy ¥ 1 d*r d'r’ (57.14) 


where the primes in the last term indicate that the arguments are r’ 
rather thanr. Thus the nuclear coulomb interaction enters in the same 
way as a special choice of V, as would be expected. 


PERTURBATION THEORY OF THE INTERPARTICLE INTERACTION 


In the remainder of this section we shall consider two perturbation cal- 
culations based on the hamiltonian (57.14). First, we shall ignore the 
transverse part of the electromagnetic field (Pi; and A) and regard the 
coulomb interaction between particles [last term of (57.14)] as a perturba- 
tion. It is not necessary to retain both the V and Z terms, and we drop 
the latter in order to conform to the notation used in Secs. 23 and 55. 
This example is of interest since it shows how the replacement of (57.11) 
by (57.12) eliminates the infinite electrostatic self-energy of each charged 
particle, which is a carry-over from classical theory. It also shows how 
the exchange interaction between charged particles, including its depend- 
ence on particle statistics, emerges from the quantized field theory. 
Second, we shall consider the interaction of a single charged particle 
with the transverse electromagnetic field; in this case the last term of 
(57.14) does not contribute since only one particle is present. 
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The unperturbed hamiltonian (e = 0 and neglect of the transverse 
field) is just (55.5), so that the unperturbed states can be described by 
the kets |n1,no, ... me, ...) in the N representation. This means 
that y is expanded in accordance with (55.11). We choose the % to be 
eigenfunctions of (23.2) with eigenvalues E,, so that our kets are eigen- 
states of (55.5) with energy eigenvalues 27,.H,. Our object is to find the 
first-order perturbed energy that arises from the last term in (57.14). 
In accordance with the general theory of Sec. 31, this is the expectation 
value of 


ff COE ae ay 


‘ 
= = a;tay'aidm lr — x’ |-tuf (rug (r’)ui(r’)un(r) dr d3r’ 
ato 
(57.15) 


for an unperturbed ket. It is apparent from the first of Eqs. (55.15) in 
the Einstein-Bose case, or from the first of Eqs. (55.26) in the Fermi- 
Dirac case, that ajan|n1, . . . ) = 0 unless there are at least two particles 


present. With symmetric statistics, we require nm > 1 and n, > 1 if — 


lL ¥ m, or m > 2 if 1 = m, in order to get a nonvanishing result. With 
the exclusion principle, only the first case is possible. 

In similar fashion, it is apparent that the subtracted quantity (57.13) 
is equal to 


e? 


z > a,ta; i lr — r’| uk (r)ur(r’) 63(r — 1’) a*r d*r’ 
kl 


The integral associated with any term in this summation for which k = I 
is evidently infinite. Thus (57.13) gives an infinite contribution to the 
energy of the system unless a,|m, . . . ) = 0 for all k, that is, unless 
nm. = 0 for all & so that there are no particles present. This infinite 
contribution is the electrostatic self-energy of a classical point charge; 
it is eliminated from the quantum theory by subtraction of (57.13) from 
the field hamiltonian. 


EINSTEIN-BOSE CASE 


The expectation value or diagonal matrix element of (57.15) for a ket 
that corresponds to two or more Einstein-Bose particles is a sum of terms, 
each of which involves one or two occupied states. The term that refers 
to a single occupied state k, for example, contains the matrix element of 
axta,ta.a, = axt(a.a,t — 1)a, = Ni? — Ny. Thus the expectation value 
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of this part of (57.15) is 
i | 
5 MM ~— 1) ‘i / Ir — x’ Heer) |2]eun(r’)|2 a3 d3r’ (57.16) 


This is equal to zero if m = 1, and to e? times the volume integral if 
m% = 2, as expected (see also Prob. 20). 

The term that refers to different occupied states k and 1 contains the 
matrix elements of a,'ajta,a1, aita,ta,a, a,ta;tara,., and a:ta,taa,, each 
of which is equal to N,N:. Thus the expectation value of this part of 
(57.15) is 


e?ngmal f {|r — r’|-! ue (x)|?|ue(r’) |? d8r d2r’ 
+ ffir — rug @)ut (e)un(r’) u(r) d3r d3r"] (57.17) 


In the case m% = m = 1, this is just what would be expected from a 
symmetric two-particle wave function of the form (40.6). As with Eq. 
(41.8), the first term in (57.17) is called the direct or coulomb energy, 
and the second term the exchange energy. 


FERMI-DIRAC CASE 


The expectation value of (57.15) for a ket that corresponds to two or 
more Fermi-Dirac particles is again a sum of terms, each of which now 
involves two different occupied states & and J. Equations (55.19) show 
that . 


atataa = —atataa = —alataa, = atataa, = —N.N, 
Since m = m = 1, the expectation value of this part of (57.15) is 
e*[f fle — x’|-*ue(e)|?ur(e’) |? d8r dir’ 
— ffir — x’ uf @)ul(r’)u(e’) u(r) dr d3r’] (57.18) 


As would be expected from an antisymmetric two-particle wave function 
of the form (40.6), the direct and exchange energies have opposite signs 
in (57.18). 


RADIATION THEORY — 


As our final example, we now show how the absorption and emission of 
electromagnetic radiation by a charged particle can be obtained from 
quantum field theory. The calculation reproduces the principal results 
obtained from the semiclassical theory of Secs. 44 and 45 but places them 
on a sounder basis. As in Eqs. (44.13), we break up the hamiltonian 
into an unperturbed part Hy and a perturbation H’. Hp is now the sum 
of the particle hamiltonian (55.5) and the transverse part of the electro- 
magnetic hamiltonian (56.7), and can also be written in the N 
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Hy = E Vy - Vy + Voly + 2rc?P i? Teg a a x A)? |e 
= )} NiEi + » fick(Nx + 4) (57.19) 


H' is the remainder of the total hamiltonian (57.14). Certain 
parts of it can, however, be ignored. As before, we drop the Z term since 
we are keeping the V term. We also drop the last term since we are no 
longer concerned with the coulomb interaction between particles. Fur- 
ther, we note from the discussion of Eq. (56.35) that a term in the hamil- 
tonian linear in A is linear in the light quantum destruction and creation 
operators a, and a,', which respectively cause the absorption and 
emission of single photons. In similar fashion the A? term in (57.14) 
gives rise to the absorption and emission of pairs of photons and to the 
simultaneous absorption of one photon and emission of another. We 
therefore keep only the part of H’ that is linear in A. With the help of 
a partial integration, and making use of the fact that V-A = 0, we 
obtain 


, _ teh 


H’ = — | Via. Vy dir (57.20) 


We now change from the Heisenberg picture, in which the y’s and 
A depend on the time but the kets do not, to the interaction picture. 
To zero order in H’, the w’s and A, and hence also H’ itself, are the same 
in the interaction as in the Heisenberg picture. The kets are no longer 
strictly constant in time but show a secular change that is proportional 
to the zero-frequency part of H’ (compare with Secs. 35 and 37). We 
therefore write H’ in a form that shows its explicit dependence on the 
time. It follows from Eq. (55.11) and the solution to Prob. 5 that 
y = Tae“*t/*y,(r), where the argument ¢ = 0 has been dropped from 
a, Then on substituting this and the expansion (56.35) into (57.20), 
we obtain 


= Fy 7” » a;' Gite, * [ cnet pr Br ReREIA f exp (tk-r) u} Vur dr 


+ dy! eB —Erthck)tin f exp (—7k +r) uF Vu dr | (57.21) 


A typical a,, term in (57.21) corresponds to the destruction of a 
photon of energy hck = hw and a particle of energy FE; and the creation 
of a particle of energy E;; in other words, a photon is absorbed and the 
particle makes a transition from state 1 to state 7. Since, as remarked 
above, the kets are essentially time-independent, this process occurs only 
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if the frequency of the corresponding part of H’ is practically zero, that 
is, if H; ~ E, + hw. Similarly, a typical a,,' term corresponds to the 
emission of a photon, and this process occurs only if EF; ~ E, — hw. Thus 
energy conservation appears automatically as a result of the calculation. ! 


TRANSITION PROBABILITY FOR ABSORPTION 


We consider first a transition from a ket |@), in which there is a particle 
in the state / and a number of photons with a range of energies, to a ket 
|8) in which there is a particle in the state j (EZ; > E:) and one less photon 
of approximate energy EH; — E;. The matrix element of H’ for this 
transition is seen from (55.15) and (56.34) to be 
; E; — Ek, — ho 

(B|H’(t)|a) = (8|T s|a)e bat a aa (57.22) 
where we have separated out the time dependence by defining the first- 
order or Born approximation T matrix element 


teh, Ga 


3 
(B|T ala) = Pa i ) / uj exp (tKk+r) em: Vurd*r (57.23) 


Here nm is the number of photons of the type absorbed that are 
present in the initial ket ja). The transition probability per unit time 
can then be calculated as in Secs. 35 and 37: 


5m | f! BIH @) la) ae 


Ww 


2 


I 


2d #*K6Isl0) , | i cpa" dt! 


4 sin? Lwgat 
a 


= )#-((8|Tla))? 


KX 


(57.24) 


We now suppose, as in Sec. 44, that the particle states 7 and / are 
discrete and that the incident radiation can be described by an intensity 
I(w) dw for the infinitesimal frequency range dw. Then the summation 
over states k of the radiation field can be replaced by an integration over 
k or wea. Each photon contributes an amount hw/L* to the energy density 
or an amount fcw/L? to the intensity. We can therefore replace 


y4 ean by > (zoids (57.25) 
kA x 


hicw 


1 Although we have assumed in this discussion that the particles are bosons, it is 
apparent that the same results are obtained if they are fermions. Particle statistics 
cannot be significant as long as only one particle is present. 
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Combination of (57.23) to (57.25), together with the substitution of 
dwg. for dw, gives for a particular polarization » 


S7e? 


2 ¢ I(w) sin? dwgat 


w weet 


f uj exp (tk- 1) em + Vu d*r 


The ws. integrand is sharply peaked at ws. = 0, and so we proceed as in 
Sec. 35 to extend the limits to +o and make use of the result 


+} 
[. a7? sin? x dx = 1: 


_ 4re*I (w) 
m’cw? 


_ E; — E 
OR 
(57.26) 


| i; uj exp (tk+ 1) e+ Vu dr |” 


This is in agreement with Eq. (44.17). 


TRANSITION PROBABILITY FOR EMISSION 


We now use the at part of (57.21) to calculate the transition probability 

for emission of a photon. In order to make the result comparable with 

that obtained in the preceding subsection, we suppose that the particle 

destroyed is in the state 7 and that created is in the state 1, where again 
_E; > E,;. The T matrix element (57.23) is then replaced by 


} 
=H | eee / uy exp (—7tk- 1) em + Vu; d?r (57.27) 


Use has been made of the second of Eqs. (56.34), and it is important to 
remember that n,, is the number of photons of the type emitted that are 
present initially. A calculation similar to that which led from (57.23) to 
(57.26) gives for the emission probability per unit time 


2 
#1) | [uf exp (—se-1) a Yu dr | 
8 eh ‘ si at 
+) amtoLs| [wf exp (—ik 1) 0 Vasa pS dest (57.28) 


where wea is still given by (57.22). 

The two terms of (57.28) evidently arise from 7, and 1, respectively, 
in the factor (7m + 1)! of (57.27). The first term is proportional to the 
intensity of the incident radiation and agrees with the expression (44.19) 
for the induced emission probability. The second term is independent of 
the intensity of the radiation present initially ; we now show that it agrees 
with the spontaneous emission probability obtained in Sec. 45. 

We can simplify the second term in (57.28) by replacing the sum- 
mation over k with an integral over w or wgz. In order to do this, we 
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require an expression for the number of states of the electromagnetic field 
that lie in the angular frequency range w to w + dw. It follows from the 
discussion of periodic boundary conditions in Sec. 11 that there are 
(L/21)* dk, dk, dk, plane waves with propagation vectors in the range 
dk, dk, dk,. Thus if we specify the direction of the propagation vector k 
by the polar angles 6, @ with respect to some fixed set of axes, the number 
of plane waves with angular frequency in the range dw that have direc- 
tions of propagation in the angular range dé d¢ is 


(L3w?/823c?) sin 6 dé dd dw 


The integration over w in the second term of (57.28) can be carried out as 
before; the result is 


Jo So" semi 


The expression to the right of the summation in (57.29) is the 
probability per unit time for spontaneous emission of a light quantum 
with propagation vector k in the angular range d@ d¢ and with polariza- 
tion A. Thus (57.29) is the total spontaneous emission probability 
associated with the transition 71. In order to compare it with the 
corresponding expression in Sec. 45, we specialize to the dipole case by 
replacing exp (—7k-r) by unity and V by —(mw/h)r in the integrand of 
the matrix element [see Eq. (44.20)]: 


2a e2u3 

ip Io 4 Qarhic? 
The two directions of polarization for each propagation vector can be 
chosen in any way so long as they are perpendicular to each other and to 
k. If we choose one in the plane of k and the matrix element (I|r|j), and 
the other perpendicular to this plane, only the former is emitted; then a 
factor sin? @ appears in the integrand, where @ is the angle between k and 
(ijr|j). Thus the polarization and angular distribution of the emitted 


radiation are in agreement with those found in Sec. 45. The total 
spontaneous dipole emission probability obtained from (57.30) is 


ih i oe sin? @ (fut Tu; d%r)* ‘ ([utru; d*r) sin 6 d0 d@ 


4e? 
= SE lela? (7.31) 


/ uy exp (—7k+ 1) ex, > Vu; dr * sin 6 de do 
(57.29) 


fxn * fut Tu; dy 


sin 6d6d¢ (57.30) 


which agrees with (45.22). 
Both the spontaneous and induced emission probabilities appear as 
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the result of a single calculation in quantum field theory, whereas these 
two effects were calculated in quite different ways in Chap. 11. As was 
pointed out above, the spontaneous emission arises from the 1 in the 
factor (vx, + 1)! that appears in the second of Eqs. (56.34). This in 
turn comes from the commutation relations (56.28) and hence is a purely 
quantum-mechanical effect. From a formal point of view, we can say 
that the spontaneous emission probability is equal to the probability of 
emission that would be induced by the presence of one quantum in each 
state of the electromagnetic field (see also Prob. 6, Chap. 11). Now we 
have already seen in Eq. (56.32) that the smallest possible energy of the 
field corresponds to the presence of one-half quantum per state. This 
suggests that we regard the spontaneous emission as being induced by the 
zero-point oscillations of the electromagnetic field; note, however, that 
these oscillations are twice as effective in producing emissive transitions 
as are real photons and are of course incapable of producing absorptive 
transitions. 


PROBLEMS 


1. Make use of Eqs. (54.12) to (54.14) to show that the classical hamiltonian equa- 
tions of motion for a field agree with the lagrangian equations in the cell 
approximation. 

2. Show that Eq. (54.20) is valid even if the functional density F depends on Vy and 
Vr as well as on y and z=. 

3. The wave amplitude y(r,,t) can be regarded as a functional to which corresponds 
the functional density y(r,t)6%(r — ro), and similarly for 7. Show that Eq. (54.20) 
gives the correct equations of motion for y and x when use is made of these functional 
densities. 

4. Extend (55.11) and the following equations to the situation in which the uz 
constitute a continuous set of functions. Then make use of Prob. 2, Chap. 3, to show 
that the measured number of particles in the infinitesimal neighborhood of any point 
is a positive integer or zero. 

5. Find the equation of motion of the Heisenberg-picture operator a,(t) defined by 
Eq. (55.11) when H has the form (55.17). Integrate this equation to find a;(t) in 
terms of a;(0). 

6. Show that the anticommutator bracket has different algebraic properties from 
the commutator bracket and the Poisson bracket. 


7. Show that N is a constant of the motion for nonrelativistic Fermi-Dirac particles. 
8. Consider asystem of Fermi-Dirac particles for which there are two states. Obtain 
explicitly the matrices and kets that are analogous to (55.23) and (55.24). 

9. Show that [p(r,t),o(r’,t)] = 0 for both quantizations of the nonrelativistic Schréd- 
inger equation, where p(1,t) = ey (r,t)y (1,1). 

10. Calculate the commutator bracket of [V X A(r,t)]? and P,(r’,t). Use the result 
to show that the equation of motion for the quantity P in a vacuum electromagnetic 
field is given by the second of Eqs. (56.6). 
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11. Show that if the commutator bracket of two cartesian components of A and P 
is given by 
(r!,t)] = thd.88(0 — -~£23- (R=) 

[A,(r,),P.(2’,t)] = 1h5,.5%(r — r) de Or, ary AFT 
then A and ¥V - P at different space points commute with each other. 
12. Show that the replacement of the last of the commutation relations (56.8) by 
that of Prob. 11 does not affect those commutator brackets that involve the field 
strengths. 
13. Show that v-H commutes with the electromagnetic-field hamiltonian (56.15) 
and hence is a constant of the motion. 
14. Carry through the calculations implied in Eqs. (56.16), and show that the first 
two of Maxwell’s equations are obtained. 
15. Calculate the commutator bracket of each of the bracket expressions in (56.18) 
with the hamiltonian (56.22), and show that the bracket expressions are constant in 
time. This means that these commutation laws are consistent with the equations 
of motion. 
16. Derive the expression (56.43) for the commutator bracket of E,(r,t) and H,(r’,t’). 
17. Obtain the lagrangian equations of motion for y, ¥*, A, and ¢ from Eq. (57.3). 
18. Find the quantum equation of motion for y when the hamiltonian is given by 
(57.10) with the substitution of (57.12) for (57.11). Show that it agrees with (44.1) 
if ¢ is replaced by ef|r — r’[—1yt(r’,t)y (r’,¢) d*r’. What result is obtained if the above 
substitution is not made? 
19. Show that the bracket expressions in (55.6) or (55.18) and (56.21) are constant 
in time for the hamiltonian of Prob. 18. 
20. Show that the expression (57.16) is obtained from the theory of Sec. 40 if there 
are n; Einstein-Bose particles in the same state k, for any value of n,. 
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